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Preface 


This compendium will be the curriculum text for the course “Mathematical 
Logic II” at the University of Oslo, Spring 2006. This compendium is a revised 
version of the compendium used for the same course in the Spring 2004. A few 
details are added, some typos are corrected, some exercises that were unsolvable 
has been rewritten and other minor details have been changed. 

In its present form, this compendium may be used free of charge by anyone, but 
if someone uses it for an organized course, the author would like to be informed. 


Blindern, 13-12-05 


Dag Normann 


Introduction 


This compendium is written primarily as a text for the course Mathematical 
Logic II given at the University of Oslo, Norway. However, much more stuff 
is included than what is needed for this; we have included material that is of 
interest to the logic group in Oslo, and that the reader might like to learn by 
self study. The course Mathematical Logic II is a continuation of the course 
Mathematical Logic, and we assume that the reader has passed this course or is 
familiar with logic up to the same level. In the Appendix we have collected some 
propositions without proofs. These are precise versions of theorems assumed to 
be known to the reader. 

We will assume that the reader is familiar with first order logic at an ele- 
mentary level, including the soundness theorem, the deduction theorem and the 
completeness theorem for countable theories. We will assume knowledge of the 
Henkin-style proof of the completeness theorem. The theorem of constants will 
be used extensively. At some occasions we will use the full completeness theo- 
rem, though a full proof use Zorn’s Lemma or comparative means. Since most 
intended readers also will follow a course in axiomatic set theory, the verification 
of Zorn’s Lemma from the Axiom of Choice is left for that course. 

This compendium is essentially consisting of two parts, Model Theory and 
Computability Theory. In model theory we mainly restrict ourselves to the clas- 
sical topics like quantifier elimination and omitting/realizing types, but have 
included a brief introduction to finite model theory. In the computability the- 
ory part we use a Kleene-style introduction, and give an introduction to the 
recursion theorem, c.e. sets, Turing degrees, a basic priority argument and the 
existence of minimal degrees. We also include a chapter on generalized com- 
putability theory. 

Now we will settle some terminology and conventions that we will use in the 
text. Our terminology and our conventions are chosen to be compatible with 
the beginners course in logic at the University of Oslo, but our terminology is 
sufficiently standard for this to cause no problems for readers with a different 
background. 


e L will be a first order language. Unless specified otherwise, L will be a 
language with equality. 


© (1, C2 etc. will be names (or constant symbols) in a formal language. 


e f,g, etc. will be function symbols in a formal language. Sometimes it will 
be convenient to consider a name as a function symbol of arity 0. 


e R, P, Q etc. will be relation symbols in a formal language. 


@ v1, V2,... is the formal list of variables, but we will mainly use x, y etc. 
instead. 


e ty, to etc. will denote terms. 


o, w, €, etc. will denote formulas or sentences. 


e A structure 2 for a language L will consist of a domain A and interpre- 
tations I* for each name, function symbol and relation symbol 1 ( 1 for 
‘letter’) in L. Whenever we use the Gothic letters 2 and 8, we will use the 
corresponding roman letters A and B for the domains of these structures 
without always mentioning it. 


e s will denote an assignment function, a function assigning an element of 
a structure 2 to each variable v;. ¢[s] will be used intuitively as the 
formula ¢ where all free variables are replaced by names for elements in 
A according to the assignment s. 


Technically we define 
sg] = c4, 


ve[s] = 5(2), 


cI 


(fer ...tn)™[s] = fs], MIs). 
A formula ¢ will technically be interpreted as a function ¢* from the set 
of assignments to the set B = {T, L} of truth values. How this is done is 


well known from any beginners course in logic. 


e We will use the terminology t{777'" and ¢f17°" for simultaneous in- 


vot 
stansiations of the terms t,,...,t» for the free occurrences of the variables 
%1,...,%p. In the latter case we assume without saying that the terms 


are substitutable for the variables in question. 


e Since the notation above is hard to read, we will sometimes write 


(£1,.-.,2n) 
when ¢ is a formula with at most 21,...,@, free. When we use this 
notation, we may write 
P(ti,..-,tn) 
instead of 
oPeee, 


We have chosen to use the notation 
{- 7 “} 
instead of the commonly used 
sb aes 
because in some expressions we would otherwise have a multiple purpose use of 
| that might cause confused readers. 


Each chapter 1-4 is supplied with a set of exercises at the end, some simple 
and some hard. The exercises are integrated parts of the text, and at the end 
the students are assumed to have worked through most of them. 

Chapter 5 will consist of just exercises. These are not integrated parts, but 
challenges the eager reader might like to face. 


Chapter 1 


Classical Model Theory 


In model theory we investigate the connection between particular first order 
theories or first order theories of a special kind and the class of models for that 
theory. The completeness theorem is one of the most basic tools in model theory, 
and some of the applications will refer to the actual proof of the completeness 
theorem. We will also, without hesitation, use the axiom of choice. 


1.1 Embeddings and isomorphisms 


In this section we will discuss more systematically ways of comparing structures 
of a 1. order language L. Some of these concepts may be known to the reader, 
but for the sake of completeness we include all relevant definitions. 


Definition 1.1.1 Let LZ bea first order language, 2 an L-structure with domain 
A. 


A substructure 8 with domain B will be an L-structure such that 
1 BCA. 
2. c® =c™ whenever c is a name (constant symbol) in L. 


3. f?(a1,.-.,an) = f*(a1,...,an) whenever f is a function symbol in L of 
arity n and aj,...,dan € B. 


4, R® = R* 4 B” whenever R is a relation symbol in L of arity n. 


Definition 1.1.2 A first order theory T is open if all non-logical axioms are 
open, i.e. have no quantifiers. 


Lemma 1.1.3 Let 2% be an L-structure. 


a) Let BC A. Then B is the domain of a substructure B of UA if and only if 
B is closed under all interpretations f* where f is a function symbol (or 
a constant) in L. 


b) Let B be a substructure of A. Let d be an open L-formula and s an 
assignment over B (and then also over I.) Then ¢[s] is true in B if and 
only if it is true in A. 


Proof 
The proof of a) is trivial and the proof of b) is trivial by induction on the 
subformula relation. 


Given a structure 2, it is sufficient to consider the class of substructures for 
most purposes where this is relevant. However, sometimes we want to go the 
other way, we might like to extend a structure. This is done e.g. in moving 
from the natural numbers to the integers, further on to the rationals and maybe 
continuing to the algebraic numbers, the real numbers or the complex numbers. 
Whenever convenient we may consider a natural number as a complex number, 
though using a standard representation of complex numbers in set theory, there 
is a long way to go. In the extreme, a natural number will be a finite set 
totally ordered by the €-relation, an integer will be an equivalence class of 
pairs of natural numbers, a rational number will be an equivalence class of 
pairs of integers, a real will be a Dedekind cut of rational numbers (or even 
worse, an equivalence class of Cauchy sequences of rational numbers) and a 
complex number will be an ordered pair of real numbers. Of course there is no 
mathematical gain in thinking of complex numbers as such monsters, it is easier 
to assume that NC ZC RC C and just base our mathematics on our intuitive 
understanding of these sets. 

When we in the context of a logic course want to extend a structure, it is 
standard procedure to use the completeness theorem in such a way that we 
ensure that a copy of the original structure will be a substructure. We have to 
do some set-theoretical hocus pocus in order to obtain an actual extension. 

We will now develop the concepts needed in order to make this precise: 


Definition 1.1.4 let ZL bea first order language and let 2( and 8 be L-structures. 


a) An embedding from % to 2% is a map 7: B — A that is one to one and 


such that 
n(c®) = c® for each name c. 
n(f"(a1,..-,@n)) = f?(x(a1),...,2(an)) for all a1,...,an € B. 
R® (ai,...,an) > R™(x(a1),..-,;7(Gn)) for all a1,...,an € B. 


b) An isomorphism will be an embedding that is onto. 


Examples 1.1.5 a) If we consider (R,0,+,<) and (R*, 1,-, <) as two struc- 
tures for the language with one name, one binary function and one binary 
relation, then 7(x) = e* is an isomorphism. 


b) If we consider reals as sets of Dedekind cuts of rationals with the inherited 
algebraic structure, then m(q) = {r € Q; r < q} will be an embedding of 
Q into R. 


c) If 3 = (Z,+,-,0,1) is seen as a structure for the language of ring theory, 
then the set of non-negative integers with the inherited algebra will be a 
substructure (but not a subring) of 3. 


There are two traditions for defining the interpretation of a formula over a 
structure. One is using assignments as we did in the introduction. Then we 
define the truth value of any formula relative to a given assignment. The other 
tradition is to extend the language with names for each element in the struc- 
ture, and then only interpret the sentences in this extended language. These 
approaches are equivalent, but it turns out that the latter approach is more easy 
to use for many of the purposes in model theory. 


Definition 1.1.6 Let L be a first order language, 2 an L-structure. 


a) By L(2l) we mean the language L where we add one new constant symbol 
Cq for each a € A. 


b) As a default, we will consider 2 as an L(2l)-structure via the interpretation 
M 
co =a. 


See Exercise 1.3 for the tedious, but simple observations that have to be made. 
Recall that a literal is a formula that is either an atomic formula or the negation 
of an atomic formula. 


Definition 1.1.7 Let L be a first order language and let 21 be an L structure. 
By the diagram of 2, D(2), we mean the set of variable-free literals in L(Q) 
that are true in 2. 


Remark 1.1.8 In some textbooks, the diagram is defined as the theory where 
all variable free sentences in L(2l) that are true in 2 are axioms. This will be 
an equivalent theory to ours, so the choice of definition is a matter of taste. 


The diagram will play an important part when we want to show that we have 
extensions of a structure 2 with some desired properties; we just have to show 
that the diagram of 2 is consistent with the desired properties: 


Lemma 1.1.9 Let L be a first order language, 2 and 8 L-structures. 
Then the following are equivalent: 


i) There is an embedding n of & into B. 


ii) We may give an interpretation of the extra constants ca from L(Q) in B 
such that B becomes a model for D(Q). 


Proof 

Assume i). Let (ca)® = (a). With this interpretation, ii) is verified. 

Assume ii). Let 7(a) = (ca). Since L is a language with equality, and 8 is a 
model for the diagram of 21, 7 will be an embedding. 


It will be a matter of taste if we talk about extensions of structures or of 
embeddings into other structures, see Exercise 1.4. Our first result is a typical 
model-theoretical one, we connect the logical form of the axioms in a theory 
with the relations between models and their substructures. Recall that two first 
order theories are equivalent if they have the same theorems. (A consequence 
will be that they are based on the same language; why?) 


Theorem 1.1.10 (Los-Tarski) 
Let T be a first order theory over a language L. Then the following are equiva- 
lent: 


1. For all L-structures A and B, if B is a model for T and °A is a substructure 
of B, then 2 is a model for T. 


2. T is equivalent to an open theory T’. 


Proof 
By the completeness theorem, two theories will be equivalent if and only if they 
have the same models. Thus 2. = 1. is already proved. 
Now assume that 1. holds. Let the non-logical axioms in T’ be the open theorems 
of T.. Then J” is an open theory, and all models for T will be models for T’. In 
order to prove that T and T” are equivalent, we must prove the converse. So, 
let 2 be a model for T’. In order to prove that T’ is a model for T, consider 
the theory 

T* = D(A) UT. 


Claim 

T* is consistent. 

Proof 

Assume not. Then there are ¢1,...,@n in the diagram of 2( such that 

T U {¢1,---, én} is inconsistent, or alternatively, that T+ 7(¢1 A--- A dn). 
Each ¢; will be an instance of some literal 7; in ZL, and we may let the same 
name c, be substituted for the same variable z in all the literals under consider- 
ation. Thus there are variables 21,..., 2m, new names Cg,,...,Cq,, and literals 
W1,---,;Wn in L such that 


bi = (ab jeteere 


Caz sam, 
for each 2 < n. 

The names c,, will not occur in the non-logical axioms in T’, so by the theorem 
of constants, TF a(41 A+++ A dn). 

However, each ¢; is a literal that is true in 2, so 61 A---A@n is a valid instance 
of Wi A+++ A tp. This contradicts that a(~1 A--- A tp) is valid in 2, and the 
claim is proved. 


Let 8 be a model for T*. There will be an embedding of 2 into B. By Exercise 
1.4, 2 is isomorphic to a substructure 2’ of 8, which by the assumption is a 
model for 7. Thus 2 is a model for T. 

This ends the proof of the theorem 


Sometimes in model theory we will construct a structure by piecewise ex- 
tensions, quite often using the completeness theorem at each step. In order to 
handle such constructions we need the concept of directed limit, a concept you 
find in other parts of mathematics as well. 


Definition 1.1.11 Let (J,<) be a partial ordering. 
The ordering is called directed if 


Vie ING € Tak Ee TG<k AG <k). 


A directed ordering is a generalization of an increasing sequence. 


Definition 1.1.12 Let L be a first order language. 


a) A directed system of L-structures will consist of 


1. A directed ordering (J, <) 
2. An L-structure 2; for each i € I 
3. An embedding mj; : 2; — 2; whenever i <j in I 


such that 7;; is the identity function on each 2; and such that whenever 
i<j <k, then mp = 1;% 0 7; (where o denotes composition). 


b) Using the notation from a), a directed limit of the system will be an L- 
structure 2 and embeddings 7; : 2; — 2 for each i € I such that 


—Ifi <j, then Tj = 15 O TG. 

— Let 8 be an L-structure and for each i € J, let n; be an embedding 
from 2; to B such that n; = nj o mi; whenever 7 < j. 
Then there is a unique embedding 7 : 2 — % such that n; = no 7; 
for each i € I. 


Remark 1.1.13 The concept of a directed limit is more general. In a category 
theoretical context it is often known as a colimit. 


Theorem 1.1.14 Each directed system 


(I, <, (Wier, {mij his;) 
will have a directed limit, and this is unique up to isomorphisms. 


Proof 

The reader should verify for her/himself that if 21 and B both satisfy the re- 
quirements of a directed limit, then 2{ and B are isomorphic, see Exercise 1.5. 
Let X be the set of ordered pairs (i,a) such that i € J and a € A; where A; is 
the domain of 2;. 

We let (i,a) & (7,6) if for some k € I we have that i < k, j < k and 
Tik(@) = 15x (0). 
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This definition does not depend on the choice of k, and = will be an equivalence 
relation. 

Let A be the set of equivalence classes [(a,i)], and let 7;(a) = [(a, 2)]. 
Whenever C),...,C, are equivalence classes, we may find an 7 € I and 
@1,.+-,@y € A; such that Cj = [(¢,a;)] for each 7 < n. Here we use that (J, <) 
is a directed ordering. 

We let 


f (Cineue Ce) = (GF Gi assaa))| 


and 


whenever f and RF are n-ary. 
This definition does not depend on i. 
If B and 7; are as in Definition 1.1.12 b), then 7 defined by 


n([(a, 2)]) = m(@) 


will be the unique embedding satisfying 7; = 0 7; for each 7 € I. 
The details left out and the rest of the proof is left for the reader as Exercise 
1.5. 


1.2 Elementary embeddings 


If two structures 2 and % for the same language L are isomorphic, they will 
have the same logical properties. In fact, this must be true as long as a logical 
property reflects the structure itself and not the underlying set-theoretical rep- 
resentation of the domains or of other parts of the interpretations. In particular, 
they will share the same properties expressible in higher order logic. Sometimes 
a concept is defined to be logical if it is invariant under isomormhisms. In the 
same spirit, we may call a concept algebraic if it is invariant under embeddabil- 
ity. It is not certain that all researchers in algebra will agree. 

Concepts like finite, well ordering, countable and complete are logical in this 
sense, but they are not expressible in first order logic. In order to make a 
discussion of this precise, we need a new concept: 


Definition 1.2.1 Let L be a first order language, and let 2 and B be two 
L-structures. 
We say that 2 and % are elementary equivalent if 


AE GSBES 


whenever ¢ is a sentence in L. 
Since it is useful to consider a structure as the same structure even if we in- 
terpret several additional names, we refer to L in the notation for elementary 
equivalence. We write 

A= oF Da) 


when 2 and % are elementary equivalent with respect to sentences in L. 


11 


In the beginning of the Twentieth century some mathematicians hoped that 
mathematics could be build on an even more solid ground by reducing it to 
logic. The Norwegian logician Thoralf Skolem demonstrated the limitations of 
this program. 


Theorem 1.2.2 ( Skolem) 

Let L be the first order language of number theory with constants for ‘zero’ and 
‘one’, function symbols for ‘addition’, ‘multiplication’ and ‘exponentiation’ and 
a relation symbol for the ordering. Let 


MN = (N,0,1,+,-, exp, <) 


be the standard model. 
Then there is another L-structure W that is elementary equivalent to Nt, but 
not isomorphic to Nt. 


Proof 

This is a consequence of the compactness theorem. Let c be a new name, let T 
be the set of sentences in L valid in Nt, and let T* be T extended by the axioms 
kn <c for each n, where k,, is the numeral in L denoting the number n. 

Each finite subtheory of T* may use 9t as a model interpreting c as a natural 
number (depending on T*). Then by the compactness theorem, there is a model 
MM’ for T*. This model will be elementary equivalent to St because both are 
models for T,, but they are clearly not isomorphic. 


Remark 1.2.3 The completeness theorem and the compactness theorem were 
not available to Skolem. His argument used another form of model theoretical 
construction, something we might call a reduced ultra-product construction, see 
Exercise 5.3. 


By a similar argument we may show that the mathematically important concept 
of a well ordering is not definable in first order logic. 


Definition 1.2.4 Let (X,<) be a total ordering. (X,<) is a well ordering if 
every nonempty subset Y C X has a least element. 


Theorem 1.2.5 There is no first order theory T with the class of well orderings 
as its models. 


Proof 
The language of any such theory T’ must be the one with just one binary relation 
symbol, we may as well denote it by <. A sentence ¢ in L will be a theorem in 
T if and only if ¢ is true for all well orderings. So let T’ be the theory with all 
sentences ¢ true for all well orderings as its non-logical axioms. We will show 
that T has a model that is not a well ordering. 

Extend L with new names c; for each natural number k and let T* be T 
extended with all axioms cz41 < cy, fork EN. 
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Each finite subtheory To of T* will have a model, since there are arbitrarily long 
finite well orderings. By the compactness theorem, T* has a model 


A= (A, <7 {cP }nen)- 


Clearly the set {c? ; k € N} has no least element, so (A,<™) is not a well 
ordering. 
However 


(Av<") ET, 
so this structure has all the first order properties shared by all well orderings. 


A concept of importance in logic in general and in model theory in particular 
is completeness. 


Definition 1.2.6 A first order theory T over a language L is complete if T is 
consistent and T+ ¢ or T | 7¢ for each sentence ¢ in L. 


The following is a direct consequence of the completeness theorem: 


Lemma 1.2.7 Let T be a first order theory over a language L. The following 
are equivalent: 


1. T is complete. 
2. X=, B for all models A and B for T. 


If L is a first order language, 2 and % are L-structures, and 7 is an embed- 
ding from 2 to B, we may view B as a structure for L(2) using the interpreta- 
tion 

(ca) = (a). 


An embedding of 8 into 2 is a 1-1 map a that preserves the basic algebra 
of %. We will also be interested in maps that preserve the first order logic of 2: 


Definition 1.2.8 Let L be a first order language, and let 2% and B be two 
L-structures. 


a) Assume that 2 is a substructure of B. We say that 2 is an elementary 
substructure of B if 2=rq) B. 


b) Assume that 7 is an embedding from 2 to S and view % as a structure 
for L(2l) as above. 
We call 7 an elementary embedding if 2=rq) B. 


We used the diagram D(2) to axiomatize that 2 essentially is a substructure. 
Likewise, we will need a formal theory that axiomatizes that 2 essentially is an 
elementary substructure. 
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Definition 1.2.9 Let L be a first order language, 2 an L-structure. The theory 
of A, Th(2), will have the sentences in L(2) that are true in 2 as the non-logical 
axioms. 


We make the following observation: 


Lemma 1.2.10 Let L be a first order language, 2X and B two L-structures. 
The following are equivalent: 


1. There is an elementary embedding x from A to B. 


2. There is an interpretation of each new constant in L(Q) as an element in 
B such that B becomes a model for Th(2). 


Proof 
Both directions are based on establishing the equality 


m(a) = (cq)®. 

1. => 2. 

Assuming 1., we interpret ca in B by (ca)™ = 7(a). Since 7 is elementary, we 
have that 2% =p(q) 8, and thus B will be a model for Th(2). 

2.=> 1. 

Assuming 2., we define m by 1(a) = (ca)”. 

Let @ be a sentence in L(2). Since B — Th(2) we must have that 


AE Gs BEY. 
UE Ad + BE +6, 


It follows that 7 is an embedding and that 7 is elementary. 


Theorem 1.2.11 Let L be a first order theory, A and B two L-structures. 
Then the following are equivalent: 


L A=rp 8. 


2. There is an L-structure € with elementary embeddings m : 2 — € and 


7:B-€. 


Proof 

2. => 1. is trivial, so assume that 21 and S are elementary equivalent. We will 
extend L by adding names c, for each a € A and a disjoint set of names dy for 
bE B. 

Let T = Th(A) UTh(B). We will prove that T is consistent. 


Assume not. We will use that the set of axioms in Th(2) and the set of axioms 
in Th(B) are closed under A. 
Then there are sentences ¢ in Th(2) and w in Th(B) such that + -=(¢ A y). 
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Let 11,.--,2n;Y1;-++;Ym be fresh and distinct variables and let $’ and 7’ be 
L-formulas such that ¢ is of the form 


CD nena 
and w is of the form 

(Werte 
By the theorem of constants 

FAgAy’), 


and using propositional logic and rules for quantifiers 


b Aay-:- Send! — Vy: Vym-". 


Now 321 --+d2n¢’ will hold in 2 since a,,...,a@, makes ¢’ true. Since 2% and B 
are elementary equivalent, 4xz,---da,¢' will be true in B as well. 

As a consequence we must have that Vy1---Vym-7w’ is true in B. But this is 
impossible, since there is an instance that makes w’ true in B, namely w. Thus 
we have observed a contradiction. 


The assumption was that T is not consistent, and our conclusion will be that T 
is consistent and has a model €. By Lemma 1.2.10 we see that 2. holds. This 
ends the proof of the theorem. 


In Exercise 1.6 we give an application of this theorem. 


In section 1.1 we discussed directed limits of directed systems. Such systems 
where all embeddings are elementary are of a particular interest: 


Definition 1.2.12 Let L be a first order language, (I, <) be a directed set and 
let ({Qihier, {Tij hic) be a directed system of L-structures. 
We call the system elementary if each 7; is an elementary embedding. 


The limit embeddings of an elementary system will be elementary, see Exercise 
1.7. 


We will prove two important theorems about elementary extensions, the so 
called Lowenheim-Skolem theorems. The theorems say that there is no way 
first order logic can distinguish between infinities that dominate the size of the 
language in question. One consequence will be the Skolem’s paradoz, 


there is an elementary countable substructure of the reals. 


This is not a real paradox, it only demonstrates that first order logic is not 
adequate for proving, or even stating, that R is uncountable. 


Theorem 1.2.13 Let 2 be an infinite first order structure over the language 
L. Let X be any set. 

Then there is an elementary extension B of A and an injective map 6: X — B. 
(We can find elementary extensions of % of arbitrarily large cardinality.) 


15 


Proof 

Let L* be L(2l) extended with a name dy for each element b € X. Let T be the 
theory Th(2l) extended with the axioms d, 4 d. whenever b and ¢ are distinct 
elements of X. 

Since 2 is infinite, 2( can be viewed as a model for each finite subtheory of T*, 
so T* has a model %. Using Exercise 1.4 adjusted to elementary embeddings, 
we see that we may let B be an extension of A. Let 6(b) = (dy)®. 5 will be 
injective, and the theorem is proved. 


This was the easy upwards Lowenheim-Skolem theorem. We will now face the 
downwards Lowenheim-Skolem theorem. Since we will not bother to get involved 
in too much cardinality arithmetics, we restrict ourselves to the countable case, 
which is the most important and best known case. 


Theorem 1.2.14 Let L be a countable first order language, and let 2 be an 
L-structure. 
Then X has a countable elementary substructure. 


Proof 
Let ¢@ be a formula in L, a,,...,@, elements of A. In order to improve the 
readability, we will let 
$(@1,..-,2n) 
mean that all free variables in ¢ are among 71,...,2%n, and we will write 
(a1,...,An) 
instead of 
Die oy 
Cay se+1Can* 
Let O(y,21,.--,;%n) be a formula in L. A Skolem function for ¢ is a function 
h: A” — A such that whenever aj,...,@n are in A then 
@(A(a1,..., Gn), @1,---,@n) is true in 2 
whenever 


there is some 6 € A such that $(b,a1,...,@,) is true in Q. 


For each such formula ¢, we select one Skolem function hg. Ifn = 0, hg will 
be a function of no variables, which will be just an element of A. In particular, 
hy=y € A. (We need the axiom of choice to justify the existence of Skolem 
functions and to select one for each formula ¢.) 

If f is a function symbol in L, f* will be one of the Skolem functions: Let ¢ be 
the formula y = f(a1,...,2%n). Then f™ = hg 

Since L is countable, the set of Skolem functions will be countable. 

Let Bo be the empty set, and by recursion, let 


Brii = {h(a1,...,an) ; h is one of the Skolem functions hg and a1,...,an € 
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By}. 


By induction on k we see that each By, will be countable, and that By, C By+1. 
Moreover, By; 4 since hy=y € By. 

Let B be the union of the B,’s. Then B is a countable subset of A, and by 
construction, B is closed under all the Skolem functions. By Lemma 1.1.3, B 
is the domain of a substructure 8 of 2. It remains to prove that B is an 
elementary substructure of 2, which amounts to prove that whenever aj,...,@n 
are in B, then ¢(a1,...@,) is true in B if and only if it is true in 2. 

This is proved by induction on the complexity of ¢: 

If ¢ is atomic, the equivalence follows from the fact that 8 is a substructure of 
2. 

If ¢6=-w or 6= 1 V we, the induction step is trivial. 

If O(a1,..-,G@n) = dyw(y, a1,...an), and 6(a1,...,@n) is true in B via some 8, 
then, by the induction hypothesis, it will be true in 2 via the same b. 

If on the other hand $(a1,...,@,) is true in 2 via some }, it is true in 2 via 
hy (a1, -.+5Gn): 

Since hy(a1,...,@n) € B, we may use the induction hypothesis and see that 
P(a1,...,@n) is true in B via hy(a1,..., an). 

This ends the proof. 


Readers familiar with cardinal arithmetics will observe that the same argu- 
ment can be used to prove the general version: 


Theorem 1.2.15 Let « be an infinite cardinal number. Let L be a first order 
language of cardinality < Kk, and let A be an L-structure. 
Then A has an elementary substructure B of cardinality at most k. 


1.3. Complete Theories 


In the previous section, we defined a consistent first order theory to be complete 
if all sentences of the language can be proved or disproved. In this section we 
will look for criteria for a theory to be complete. 


1.3.1 Categorical theories 


A first order theory is categorical if all models are isomorphic. If a theory is 
categorical, it is a consequence of the upwards Léwenheim-Skolem theorem that 
all models are finite. Thus this concept is of a rather limited interest. The 
following is more interesting: 


Definition 1.3.1 Let L be a countable first order language, and T a theory 
over L. T is w-categorical if all countable models are isomorphic. 


It is easy the see that a consistent w-categorical theory must be complete, see 
Exercise 1.8. In Exercise 1.12 we see that the converse is not true. 
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Example 1.3.2 Let DO be the first order theory over the language L with 
equality and one binary relation symbol <, and with the following axioms: 


DO-1 -=(a < x) 
DO-2 t<yAy<z2>4K<z 


DO-3 t<yVy<a4Var=y 


DO-4 «<y— Judviw(u<t@An<vAv<yAy<w) 


The first three axioms tells us that < is a total ordering, while DO-4 implies 
that there is no largest element, no least elements, and that the elements in the 
ordering are densely ordered. DO stands for dense ordering. 

We claim that DO is w-categorical. We give an outline of the proof, and leave 
the details as Exercise 1.9 

Let (A,<,4) and (B,<g) be two countable models for DO. A finite partial 
isomorphism will be an order preserving map p: K — B where K C A is finite. 
The set of finite partial isomorphisms have the following extension properties: 


e If p is a finite partial isomorphism and a € A, then p can be extended to 
a finite partial isomorphism q defined on a. 


e If p is a finite partial isomorphism and b € B, then p can be extended to 
a finite partial isomorphism q with 6 in its range. 


Given enumerations A = {a, ; n € N} and B = {b, ; n € N} we can con- 
struct an increasing sequence of finite partial ismorphisms {p,}nen securing 
that pon(a,) is defined and that 6, is in the range of pon4i1. The limit of these 
finite partial isomorphisms will be an isomorphism between the two structures. 


We will discuss w-categoricity in more depth in the paragraph on element types 
and in the chapter on finite model theory. The method of proof is for obvious 
reasons called a back-and-forth construction. Sometimes in the literature, it is 
called a zig-zag-construction. 


1.3.2 Elimination of Quantifiers 


One of the key success stories of model theory is that of proving completeness 
of a theory via elimination of quantifiers, and thereby proving theorems about 
algebraic theories of genuine interest. Our key example will be field theory, but 
the main application is the theory of real closed fields. We have used traditional 
texts on model theory as sources for our exposition in this and later sections on 
model theory, in particular Sacks [2]. We will assume that the reader is familiar 
with field theory, but give a brief introduction for the sake of completeness. 
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Field Theory 


The language of field theory that we will use, will consist of three constants, 0, 
1 and —1 and two binary function symbols + and -. As logicians, we should 
make a clear distinction between these symbols and the symbols we use for the 
particular interpretations, but unless forced by the circumstances, we will not 
do so. 

Our axioms for field theory, F’, will be: 


F Ol (c+y)+z2=2+4(y+z) 


F 02 2¢+0=2 

F 03 «+(-1-2)=0 
F04e¢+y=yt+2 

F 05 (4@-y)-z=«a: (y-z) 
F 06 7-l=2 

F 07 «40 — dy(a-y =1) 


F 08 c-y=y-2 
F 09 «-(y+z)=a-yH+u-z 
F 10041 


We will use the standard algebraic notation like e.g. ry and —x for x- y and 
(—1)-y resp. 
A field will be a model for this theory. Each natural number n will have an 
interpretation in a field ¥. If § is a field and p is a prime, we say that the field 
have characteristic p if 
FEp=0. 

Zy = {0,...,p— 1} where addition and multiplication are carried out modulo 
p (where p is a prime) is an example of a field of characteristic p. If the reader 
is not familiar with this fact she/he should verify it by her/himself. A field 
will be of characteristic 0 if it is not of characteristic p for any prime p (a non- 
prime cannot be the characteristic of a field). The distinction between finite 
and infinite characteristics cannot be made in first order logic, neither can the 
distinction between finite and infinite fields. See Exercise 1.10 for more precise 
statements about this. 

Other familiar examples of fields are Q, R and C with the standard algebraic 
structures. All these fields will have characteristic 0. 


Definition 1.3.3 A field § is algebraically closed if each polynomial P of one 
variable and degree > 0 will have a root. This can be expressed by the infinite 
set of axioms 


Yn £0 Ax(ynz” + Yn—-1e" | +++ + yo = 0), 
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where x” has its usual mathematical meaning, and n > 1. 
We let ACF denote the theory of algebraically closed fields, and AC'F(p) or 
ACF (0) denote the extension where the characteristic of the field is specified. 


Lemma 1.3.4 An algebraically closed field is never finite. 


Proof 
Let a1,...,@n be elements of an algebraically closed field, and let 


P(x) = (a —a1)++-(@—a,) — 1. 
If P(x) =0, then « A ay,..., Gn. 


We will prove that ACF (0) and AC F'(p) for each p are complete, combining 
methods from logic and basic algebra. In proving this, we will isolate concepts 
of a more general interest, and we will state and prove general lemmas and 
theorems that will have the completeness of these theories as consequences. At 
the end, we will introduce the theory of Real Closed Fields, and use our method 
to prove completeness of this theory as well. 


The isomorphism property 


Definition 1.3.5 Let T be a first order theory. 

We say that T has the isomorphism property if whenever 2% and 8 are models 
for T,, %9 and Bo are substructures of 2 and B resp. and 7 is an isomorphism 
from 29 to Bo, then 7 can be extended to an isomorphism between submodels 
2, and B, of A and B. 


Example 1.3.6 The theory DO has the isomorphism property: 


Let (A,<) be an ordering. A gap in A will be an empty open interval (a,b), 
(—,a) or (b,—) induced by a pair a < b with no c between them, a minimal a 
or a maximal b. The gap structure reflects all instances of failure of the DO- 
axioms. 

If we have two dense orderings and two isomorphic suborderings, either these 
suborderings are themselves dense, or they will have isomorphic gap structures. 
Using the proof of the w-categoricity of DO we see that each pair of gaps can 
be filled by copies of Q taken from the given dense orderings. The result will 
be isomorphic submodels. We leave the details for the reader. 


Example 1.3.7 The theory of fields has the isomorphism property: 


A substructure of a field will contain 0, 1 and —1, and will be closed under 
summation and multiplication, so a substructure is actually a subring. Thus the 
task is to show that isomorphic subrings can be extended to isomorphic subfields 
inside the given fields. This is basic algebra, we have isomorphic ” quotient- 
structures” of the two rings, and interpreting each quotient in the fields will 
give isomorphic subfields. 


We also use standard algebra to prove 
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Lemma 1.3.8 The theory ACF will have the isomorphism property 


Proof 

Since this compendium is not a textbook in algebra, we will not give all the 
details. 

Given a field ¥ and an irreducible polynomial P(«) over that field we may 
extend the field by a root of that polynomial in a purely algebraic way, by 
considering the field of all polynomials Q(«) modulo P(x). If ¥o is a subfield 
of an algebraically closed field ¥, and a € ¥ is a root of P, then the map 
Q(x) + Q(a) is an isomorphism between the formal extension and a larger 
subfield of ¥. (Euclid’s algorithm for the largest common divisor applied to 
polynomial division is central here). 

The algebraic closure of a field can be described as the result of a transfinite 
sequence of such formal extensions, so the algebraic closures of isomorphic fields 
within algebraically closed fields will be isomorphic. 


In Exercise 1.11 we will give an alternative proof. 


Lemma 1.3.9 Let T be a first order theory over a language L and assume that 
T has the isomorphism property. 

Let T’ be obtained from T by adding new constants to L, but no new non-logical 
axioms. Then T’ has the isomorphism property. 


The proof is trivial and is left for the reader. 


The submodel property 


Definition 1.3.10 Let ¢ be a formula in a first order language L. We call ¢ 
simple if @ is of the form Jzrw where y is open. 


Definition 1.3.11 Let LZ bea first order language, T a theory over the language 
L. 

We say that T has the submodel property if whenever % is a model of T,, 2 is 
a submodel of 8 and ¢ is a simple sentence in L(2), then ¢ is true in 2 if and 
only if it is true in B. 


Example 1.3.12 The theory DO has the submodel property. 


Dropping formalities, let ¢ be the sentence Jxy(x,a1,...,dn). w can only ex- 
press that x equals some a; or that it is larger than some a;’s and smaller than 
others. If one such can be found in some dense ordering extending {a1,..., an} 
then it can be found in all such dense orderings. 


Example 1.3.13 Field theory does not have the submodel property 


Consider the statement da(x- a = 2). This sentence is not true for the field of 
rationals, but for the field of reals. 


Lemma 1.3.14 The theory of algebraically closed fields has the submodel prop- 
erty. 
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Proof 

Let 8 be an algebraically closed field, and let 2% be an algebraically closed 
subfield. Let ¢ = dxw be a simple sentence in L(2l). Then ~ is a Boolean 
combination of atomic formulas €(x,a1,...,@n), where a1,...,@, are in Al. 
The function symbols are denoting plus and times, so each term t(az) in L(2l) 
will be equivalent to a polynomial in the variable x with coefficients from 2. 
Since = is the only predicate, each atomic formula €(x, a1,...,@,) is equivalent 
to a polynomial equation 


P:(x) =0 


where Pz has coefficients from 2. Pe will have a root in 2 if and only if it has 
a root in B. 

If ¢ is true in 2, then ¢ is trivially true in 8. On the other hand, assume that 
@ is true in B via b, ie. w(b,a1,...,@n) is true in B. If one of the atomic 
subformulas € are true for 2 = 6, then b is the root of the polynomial P; with 
coefficients from 2l, so b is in 2. On the other hand, if none of the subformulas 
€ of ~ are true for x = b, we use that 2% is infinite (see Lemma 1.3.4), and that 
a finite set of polynomials only will have a finite set of roots. Thus let x = a, 
where a is in 2% and a is not making any of the subformulas € of w true. Then 
a and b satisfy the same atomic subformulas of ~, so W(a,a1,...,@n) must be 
true. This means that ¢ is true in 2. 


Lemma 1.3.15 Let L be a first order language and let T be a theory over L 
that has the submodel property. Let T’ be obtained from T by adding new names 
to L but no new non-logical axioms. 
Then T’ has the submodel property. 


The proof is easy and is left for the reader. 


Elimination of quantifiers 


Definition 1.3.16 Let L be a first order language, T a theory over L. 
We say that T accepts elimination of quantifiers if we for each formula ¢ have 
an open formula ~ such that 

TEé@eyw. 


Lemma 1.3.17 Let L be a first order language and T a theory over L. Then 
T accepts elimination of quantifiers if and only if each simple formula in L is 
equivalent in T to an open formula. 


Proof 

The ‘only if’-direction is obvious, so assume that each simple formula in L will 
be equivalent in T to an open formula. 

By induction on the complexity of a formula ¢, we prove that it is equivalent 
in T to an open formula, where we use the assumption to handle the quantifier 
case. 
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Lemma 1.3.18 Let L be a first order language, T a theory over L. 

Let @ be a sentence in L, and assume that whenever A and B are models for T 
satisfying the same variable free sentences, then @ has the same truth values in 
A and in B. 


Then @ is equivalent in T to a variable free sentence in L. 


Note 

If a formula ¢ is variable free, it contains neither bound nor free variables, 
and thus no quantifiers. Another way of describing such formulas are as open 
sentences. 


Proof 

Let To be T extended with all variable free theorems in TU {9}. It is sufficient 
to show that ¢ is a theorem in To. 

Assume not, and let 2 be a model for Ty U {79}. 

Let A be the set of variable free sentences true in 2, and let & be a model for 
T UA. Then the same variable free sentences are true in % and in B (namely 
A), so @ will hold in 8 as well. 

8 was an arbitrary model for TU A, so by the completeness theorem, 


TUAE 27¢. 
Then there are £1,...,&) € A such that 


T, PF A(E, A+++ A&). 


Then =(& A--- A &,) is an axiom in Ty and 
AE Whi A+: A En). 


This contradicts that 2  €; for each 7. 
The assumption was that ¢ is not a theorem in Jo. This led to a contradiction, 
so we are through. 


Theorem 1.3.19 Let L be a first order language, T a theory over L that has 
both the isomorphism property and the submodel property. Then T accepts elim- 
ination of quantifiers. 


Proof 

By Lemma 1.3.17 it is sufficient to show that if dé = Jara(x,71,...,¢n) where 
w is open, then ¢ is equivalent in T to an open formula. We may without loss 
of generality assume that n > 1. Extend T to T’ by adding names e1,...,€n, 
but no new non-logical axioms. By Lemmas 1.3.9 and 1.3.15, T’ has the iso- 
morphism and submodel properties. 


We will show that Jxi(x,e1,...,en) is equivalent in T’ to a variable free sen- 
tence €(€1,...,€n). Having done this, we conclude, using the theorem of con- 
stants, that daw(a,71,...,%,) is equivalent in T to €(11,...,2n). 


By Lemma 1.3.18 it is sufficient to show that whenever 2 and B are two models 
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for T’ satisfying the same variable free formulas, then Jxy(2,e1,...,én) will 
have the same truth values in 2 and in 8. 

If 21 and B satisfy the same variable free formulas, the respective minimal sub- 
structures 2% and Bo consisting of all interpretations of closed terms will be 
isomorphic. By the isomorphism property for T’, this isomorphism can be ex- 
tended to an isomorphism between two submodels 2, and 8, of 2% and B resp. 
Since T” has the submodel property, we have that Jaq)(x,e1,...,en) will have 
the same truth values in 2 and 2; and in 8 and B,. Since 2, and By, are 
isomorphic, we are through. 


Prime Models 


Definition 1.3.20 Let L be a first order language with at least one constant 
symbol. Let T be a first order theory over L, and let 2 be an L-structure. 2 is 
called a prime model for T if 2 can be embedded into any model % for T. 


Note: A prime model for T is not necessarily a model for T. Actually we have 


Lemma 1.3.21 Let L be a first order language, T a theory over L and A a 
prime model for T. Then X is a prime model for each extension T’ of T over 


L. 


The (trivial) proof is left for the reader. 


Example 1.3.22 Let L be the language of DO extended with the constants c; 
forieN. 

Let DO* be the theory DO extended with the axioms ¢; < c¢j41. 

Then N with its usual ordering, and with 7 as the interpretation of c;, is a prime 
model for DOT. 


Examples 1.3.23 The theories ACF(0) and ACF(p) have prime models N 
and Z, respectively. These are also prime models for the theories of fields with 
a fixed characteristic. 


In Exercise 1.12 We will see that DO* is a complete theory that is not 
w-categorical. 


Lemma 1.3.24 Let T be a first order theory over a language L, and assume 
that L has at least one constant symbol. Then the following are equivalent: 


1. T has a prime model. 
2. Each variable free formula in L is decidable in T. 


Proof 

First assume 1. Let 2( be a prime model for JT. Then for each model 8 for 
T and each variable free ¢ (the constant symbol in L ensures that there are 
variable free formulas), the truth value of ¢ in 8 will be the same as in 2%. Thus 
the existence of a prime model ensures that each variable free formula either is 
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true in all models for TJ, and thus is a theorem, or is false in all models, and 
thus is disprovable in T. Thus they are all decidable in T, and 2. is proved. 
Now assume 2. For a model 2% of T, let 29 be the submodel consisting of all 
interpretations of closed terms ¢ in I. 

If & and B are two models for T, we define 7 : %y + Bo by 7(t*) = t®. Since 
all variable free formulas are decidable, 7 is well defined and an isomorphism. 
This shows that lo is a prime model for 2% whenever A — T. 


A completeness criterion 


Theorem 1.3.25 Let L be a first order language and let T be a theory over L 
such that 


1. T is consistent. 

2. The language L of T has at least one constant symbol. 
3. T has a prime model. 

4. T has the isomorphism property. 

5. T has the submodel property. 


Then T is complete. 


Proof 

By Theorem 1.3.19 we have that T accepts elimination of quantifiers. 

Let ¢ be a sentence in L and let ~ be open such that TF ¢ — yw. We may 
assume that ¢ is variable free, since otherwise we may replace it by ¢(c,...,¢), 
where c is a constant of the language. 

Since T has a prime model, w is decidable in T by Lemma 1.3.24, so @ is 
decidable in T', meaning that TF ¢ or T+ 7. 


Corollary 1.3.26 The theories ACF(0) and ACF (p) for prime numbers p are 


complete. 


1.3.3. Real Closed Fields 


One of the first success stories in model theory was the proof due to Tarski that 
the theory of Real Closed Fields permits elimination of quantifiers, and thus 
that it is complete. 


Definition 1.3.27 Let the theory OF of ordered fields be field theory extended 
with one binary relation <, the axioms for total orderings and the axioms 


r<syrorutrz<ytz. 


0<2@A0<yoO0< czy. 
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There are obvious models for OF’, see Exercise 1.16. 

Working with ordered fields we may use most of our intuition about subfields 
of R, though there will be ordered fields that are not subfields of R. Let us 
consider an example: 


Example 1.3.28 Let Q be the standard ordered field of rationals. Then each 
natural number n is the interpretation of some term 7 in the language. 

Let c be a new constant, let T = Th(Q) and let T* be T extended with the 
axioms 7 < c for each number n. 

By the compactness theorem, T* has a model Q*. This will be an ordered field, 
but certainly not a subfield of R. 


Lemma 1.3.29 OF /0 42-27 >0. 


Proof 

By the first new axiom it follows that « <0 — —a > 0. Since (—2x)? = «x? for 
any field, we may split the argument in the two cases x > 0 and x < 0 and use 
the second new axiom. 


We will use standard algebraic properties of ordered fields without proving them 
in detail from the axioms. The uneasy reader should fill in the details for 
her/himself. We will show a little more care about facts that are proved using 
methods from analysis. 

Recall from basic analysis that an ordering is complete if each bounded nonempty 
set has a least upper bound, 


Lemma 1.3.30 Each ordered field § can be extended to an ordered field 6 where 
the ordering is complete. 


Proof 
Let ¥ be an ordered field with domain F’. Let a < b be in F.. then 


b 
a-1<a< 9 <b<b41, 


so the ordering of F will be a model for DO. 
In a dense ordering X, a Dedekind cut will be a set Y C X such that 


Y is bounded in X. 
Y is an initial segment of X. 
Y has no largest element. 


The Dedekind cuts are ordered by inclusion. The original ordering may be 
embedded into the set of Dedekind cuts by 


mx) ={y; y <a}. 


If § is an ordered field, we may extend the algebra on § to an algebra on the 
set of Dedekind cuts as follows 
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1 X+X'={a-b; aE XAbdE X'S. 


2. If X and Y both contains positive elements, let 


X:-Y={zeF;dre Xdye V(x >0Ay>0Az<a-y)}. 


3. (—1)-X = {-a#; x is an upper bound for X, but not a least upper bound 
for X}. 


4. For other pairs X and Y of Dedekind cuts we compute X -Y using 2. and 
3. 


The set of Dedekind cuts with the induced algebra is called the topological 
completion of §. The topological completion will itself be a field, and ¥ will be 
isomorphic to a subfield of the topological completion. The details are left as 
Exercise 1.17. 


For an ordered field % we may define continuity of functions f : F — F by the 
standard ¢ — 6-definition. Then every polynomial P(x) will define a continuous 
function. The details are left as Exercise 1.18. 


Lemma 1.3.31 Let § be an ordered field, P(x) a polynomial over § of uneven 
degree 2n+1. Then P(x) has a root in the topological completion. 


Proof 

The proof of this theorem for the reals, using the intermediate value theorem, 
works without change for any topologically closed ordered field, and thus for 
the topological closure of . 


We also have 


Lemma 1.3.32 Let a > 0 in an ordered field. Then a has a square root in the 
topological completion. 


Proof 
Let C = {b; b<OV(0< DAD? <a}. 
This is a Dedekind cut corresponding to \/a. 


Definition 1.3.33 The theory RCF of Real Closed Fields is obtained from OF 
by adding the following axioms: 


dy(a@ > 0 y2 = 2). 


Yon+1 £0 Fa(yo + ye +--+ + yongi2?”*t = 0) for each number n. 


Since the topological completion of the topological completion of an ordered 
field ¥ is isomorphic to just one topological completion of §, we just have shown 
that the topological completion of an ordered field will be a real closed field. 
The algebraic completion will be the least real closed subfield of the topological 
completion containing the image of the original ¥ under the embedding. All 
the roots in the topological completion of a polynomial P(«) in ¥ will be in the 
algebraic completion. This is basic algebra. 

The isomorphism property for RCF is a consequence of the following 
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Lemma 1.3.34 Let § and © be ordered fields where § is a subfield of 6 and 6 
is real closed. 
Then the algebraic closure of § is isomorphic to a subfield of 6. 


Proof 

Let § and 6 be the topological completions of F and 6 resp. 

Since ¥ is a subfield of 6, there is an inherited embedding 7 : § > 6. 
& is isomorphic to a real closed subfield 6’ of 6 and § is isomorphic to a 
subfield %’ of §. Then 3%’ is a subfield of t~!(G), which is real closed, and by 
our construction of the algebraic completion, the algebraic completion of ¥ is 
isomorphic to a subfield of 7~1(6’) and thus to a subfield of 6. 


Lemma 1.3.35 RCF has the submodel property. 


Proof 

Let ¥ and © be two real closed fields, ¥ a subfield of 6. 

Let ¢(x) be a quantifier free formula in L[¥] and assume that 6 — Jzr¢(z). 
Each atomic formula ~(2) in L[¥] will be equivalent either to a polynomial 
equation P(x) = 0 or to a polynomial inequation P(x) > 0. Let y(a),..., x(x) 
be the atomic subformulas of ¢ and let P,(x),..., P,(x) be the corresponding 
polynomials. 

Let {a1,...,@n} be the finite set of roots of the polynomials P;(x),..., P(x) in 
increasing order. The complements in ¥ and in 6 will be corresponding finite 
unions of open intervals on which each of the polynomials P;(2) will have a 
constant signature ‘positive’ or ‘negative’ (if not P;(x) will have a root in the 
interval in the topological completion, and thus in the algebraic completion). 
Let b € |6| be such that 6 — ¢(b). If 6 is a root in one of the polynomials, 
b € |§| and we are through. If b is in one of the intervals, find a € |§| in 
the corresponding interval. Then a and b will make exactly the same atomic 
subformulas of ¢ true, so ¢(a) = ¢(b), and we are through. 

We may now conclude 


Theorem 1.3.36 The theory RCF is consistent, has a prime model and per- 
mits elimination of quantifiers. 
Consequently RCF is complete. 


1.4 Element types 


Throughout this section, we will let L be a fixed countable first order language, 
and we will let T be a complete theory over the language L. 


Definition 1.4.1 An n-type is a maximal set X of formulas ¢(21,...,2n) such 
that for all ¢1,...,o, in X we have that 


Tb Say +++ datn(o1 A+++ A op). 
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We will use the term complete for this property. Each consistent set of formulas 
with at most 21,...,% free can be extended to an n-type. 

An n-type will be closed under A, and for each formula ¢ with at most 71,...,2n 
free the maximality of X will ensure that ¢ € X or ng € X. 


Definition 1.4.2 Let X be an n-type and 2 a model for T. We say that 2 
realizes X if for some @1,...,@, in A, AE P(a1,...,an) for each dE X. 
If 2 does not realize X we say that 2 omits X. 


Example 1.4.3 Let Mt be the standard model for number theory and T = 
Th(9). Let X be a complete extension of {k, < x; n € N} where k,, is the 
numeral for n in the language. This is a 1-type that is omitted by the standard 
model. The construction of a nonstandard elementary extension of St used to 
prove Theorem 1.2.2 amounts to constructing a model realizing this type. 


The method used to prove Theorem 1.2.2 is quite general, and can be used to 
prove the following 


Theorem 1.4.4 Let X be an n-type. Then there is a model & for T realizing 
XxX. 


Proof 
The proof is left as Exercise 1.20. Add new names for the n variables and use 
the compactness theorem. 


Some n-types will be realized by every model of T. 
Definition 1.4.5 An n-type X is principal if for some ¢ € X we have that 
Thréd-w 


for all ~ € X. We call ¢ a generating formula. If X is not principal, we will use 
the term non-principal. 


Lemma 1.4.6 If X is a principal type, then X is realized in every model A of 
T. 


Proof 
Any interpretation of 21,...,% that satisfies the generating formula will realize 
the type. 


Lemma 1.4.7 Let n be fixed. 
Then the following are equivalent: 


1. There are infinitely many n-types. 


2. There is a non-principal n-type. 
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Proof 
First assume that there is a non-principal n-type X = {¢; ; 1 € N}. For each 
i © N there must be a 7; > 7 such that 


TH O1 A+ Abi > 0}, 


since otherwise the type will be principal. Thus, {¢1,...,¢:,7@;,} must be 
consistent, and there is an n-type X; extending this set. Clearly, if 7 > j;, then 
X, and X; are different, so there are infinitely many n-types. 

Now assume that all types are principal. Since L is countable, we will only be 
able to use countably many generating formulas, so the set of n-types is at most 
countable. 

With the aim of obtaining a contradiction, assume that this set is infinite. 
Let {Xi}ien be an enumeration of all the n-types. Let $; be a generating 
formula for X;. Then, if 1 # 7 we must have that —¢; € X;. In particular, if 
j > maxr{iz,...,in} we will have that =(¢;, A... A di,) € Xj. 

It follows that {=; ; 7 € N} is a consistent set (here we use the assumption 
that there are infinitely many principal n-types), and may thus be extended to 
an n-type different from all the X;’s. This contradicts the assumption that we 
have enumerated them all, and the lemma is proved. 


If 2 is a model for T and aj,...,@, are elements in A (possibly with repe- 
tition), then a1,...,@n realizes exactly one n-type, 


X = {b(41,...,%n); WE O(ai,...,an)}. 


This indicates that the set of types may give us information about the class of 
countable structures. One example of this is 


Theorem 1.4.8 Assume that all n-types are principal for all n. 
Then T is w-categorical. 


Proof 

We will elaborate on the proof of the w-categoricity for the theory DO. 

Let 2 and 8 be two countable models for T with domains A and B. Let 
A’ = {aj,...,@n} C A and B’ = {b,...,bn} C B be finite sets (without 
repetition) and let p(a;) = b;. We call p a partial isomorphism if {a1,...,an} 
and {b1,...,b,} realize the same n-type, we have that 


Claim 
If p is a partial isomorphism and a € A, then p can be extended to a partial 
isomorphism q defined on a. 


Let us first see how to prove the theorem from the claim. The statement is 
symmetrical in 2% and 8, so given b € B we may as well extend p to a q with 
b in its range. 2% and % will be elementary equivalent since Tis complete. We 
may consider the empty function as a partial isomorphism defined on the empty 
set. Using the extensions obtained from the claim, we then use the back-and- 
forth strategy and build up a sequence of partial isomorphisms ensuring that 
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each a € A will be in the domain of one of the partial isomorphisms, and each 
b € B will be in the range of one of the partial isomorphisms. In the end, we 
have constructed a total isomorphism. 


Proof of claim 
Let X be the n+ 1-type of {a1,...,@n,a}. Since we have assumed that all 
n + 1-types are principal, there is a generating formula $(71,...,%n,;%n41) in 
X. 
Since 2 | (a1,...,@n,a) we have that 

WE Aan+10(a1,---, Gn, Ln41). 
Since {a1,...,@,} and {bi,...,b,} realize the same n-type, we have that 


SB [= Atn4io(b1, fo 5,0 son; Ln41)- 


Choose 6 € B such that $(b1,...,bn,b) holds in B and let g(a) = b. 
Let w € X. Since 


TH O(41,.--,2n;En41) 9 W(41,.--,Ln,2n41) 


we see that ~(b1,...,0n, 6) will hold in B. This shows that X is the type realized 
by {bi,...,6n, 5}, and the claim is proved. 


As a consequence we see that if the number of n-types is finite for each n, 
then T is w-categorical. We will prove the converse to this, but in order to 
do so we need the theorem about omitting types. The proof of the omitting 
type theorem is based on the Henkin-style proof of the completeness theorem 
for countable, first order theories. In that proof we we first extend a theory T 
to a Henkin theory Ty by adding Henkin constants cy and Henkin axioms 


2d — d%,, 


and then extend Ty to a complete theory Tc. The model realizing the com- 
pleteness theorem is then the term model of Tg. In this argument there is an 
amount of freedom, since there may be many ways to construct Tc from Tr. 
This freedom will be used in the proof of the omitting type theorem. 


Theorem 1.4.9 Let X be a non-principal n-type. Then there is a model for T 
omitting X. 


Proof 
The idea of the proof is as follows: We extend the language L of T with Henkin 
constants ¢1, C2,. . . and we extend T with Henkin axioms. Then, following 


the proof of the completeness theorem, we must make a complete extension 
of this extended theory and then form the term model. The term model will 
consist of equivalence classes of closed terms, and in fact, via the Henkin con- 
stant for da(x = t), we see that each equivalence class will contain a Henkin 
constant. Thus we must ensure, during the completion process, that for any 
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collection c;,,...,¢,, of Henkin constants there is a formula ¢@ € X such that 
a(ci,,--+,Ci,) is added to the theory. 

In the proof of the completeness theorem we ad Henkin constants and Henkin 
axioms in waves. However, we prove that each wave is countable. Thus the set 
of new constants is countable, and we may just organize them in a list as above 
such that if c; is the Henkin constant of dr@ and ¢; occurs in ¢, then i < j. 
Recall that the Henkin axiom is 


Arg(x) > o(c;). 


For the sake of simplicity, we will assume that n = 1. 
For each Henkin constant c, we intend to find a formula ¢, € X such that 


TU {7¢1(c1),---, 7% (cK) } 


is consistent with all Henkin axioms. 

From Henkin’s proof of the completeness theorem we know that adding a Henkin 
axiom introducing a new Henkin constant will preserve consistency. Thus, in 
the process of verifying that $;(c;) is consistent with the extended theory so 
far and all the Henkin axioms, we only need to be concerned with the Henkin 
axioms involving c,..., Ck. 

The ‘construction’ is by recursion on k, so assume that 


T U{7¢1(c1),--.,7be-1(Cr-1)} 


is consistent with all the Henkin axioms. 

Let €(c1,...,cx) be the conjunction of all -¢,;(c;) for 7 < i and all Henkin 
axioms introducing c1,..., Cx. 

Then T, &(ci,..., cx) is consistent. 

With the aim of obtaining a contradiction, assume that T,€(c1,..., cx), 70(ck) 
is inconsistent for all ¢ € X. 

Then TF €(c1,...,cn) > O(c) for all dE X. 

By the theorem of constants 


TE &(y1,---,Yk—-1, 0) > O(a) 


and thus 


TE dyr- ++ dyp-1€ (yi, +--+ Yr-1, 2) > (2) 


for all 6 € X. Since X is complete, there are two possibilities 


1. Sy. - ++ Syn—1€ (yi... +5 Ye—1,v) © X 


2. ayy +++ Sye—1€ (ys. + Yr-1,%) © X. 


Case 1. contradicts that X is non-principal. 
In case 2., we let @ = 7dyi +++ dye—1€(y1,---, Ye—1, 2) and see that 


TF dy: + + dye—1€(y,- +. yr—1, 0) Tyr +: dyp—1€(y1,---, Ye-1, 2). 
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By propositional logic it follows that 


Tb 7d yy ++: dyn—1€(y1,- ++ Ye—-15 2) 


contradicting the induction hypothesis, i.e. that T,€(c1,...,c%) is consistent. 
The assumption was that T,€(c1,..., cx), 70(cx) is inconsistent for each ¢ € 
X. This led to a contradiction, so there is a ¢, € X such that 


T,&(c1, er Ck); 7Ok (Ck) 


is consistent. 

The choice of € was such that T,741(c1),...,7¢%(cx) will be consistent with all 
the Henkin axioms. 

Let T’ be T extended with all 4¢;(c,) and all Henkin axioms. T“” will be a 
consistent Henkin theory. Let T° be a completion of T’. Then the term model 
of T° will be a model for T omitting X. 


In Exercise 1.24 we will discuss why the assumption that n = 1 is a harmless 
one. 


Corollary 1.4.10 If there are infinitely many n-types for some n, then T is 
not w-categorical. 


Proof 

If there are infinitely many n-types, one of them must be non-principal. Let X 
be non-principal Then there is one model 2 realizing X and one model B 
omitting X. These models cannot be isomorphic. 


Suplementary material for the advanced reader 


This tiny section can only be read with complete understanding by a 
reader with a background from descriptive set theory and cardinality 
arithmetics. 


An n-type is a subset of the set of formulas $(21,...,%n), which is a countable 
set. Via e.g. a Gddel enumeration we may consider an n-type to be a subset 
of the natural numbers. The set of subsets of N forms a compact, metrizable 
topological space in a natural way, homeomorphic to the Cantor space. 

There are two requirements that have to be satisfied by an n-type. Given 


G1,--+,Oe EX 


we must have that T+ 3a1---dan(od1A---A@x), and X must be complete. The 
first requirement defines a Gs-set and the second a closed set. Thus the set of 
n-types is a G'5-set of sets of formulas. 

By a standard and classical result of descriptive set theory, any G'5-set is either 
finite, countable or has the cardinality of the continuum (this holds for a much 
more general class than the G5-sets). Thus, if there are uncountably many n- 
types for some n there will be as many isomorphism classes of countable models 
for T as there are reals, since & many countable models will at most realize w x k 
many n-types, and if K < 2” then w x K < 2”. 
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1.5 Saturated structures 


In a sense we may say that the more n-types a model 2 for T realize, the richer 
the model is. The most generous would have been if 2 realizes every n-type for 
every n. Of course, if there are uncountably many n-types for some n, this is 
impossible, a countable structure can realize at most countably many n-types. 


Definition 1.5.1 Let 2 be a model for T, Ap C A. 

Let L(Ao) be LZ extended with names for each a € Ap. By abuse of notation 
we will use a both as the name and for the object. Let T[Ao] be the complete 
theory of all L(Ao)-sentences that are true in 2. 


There is no reference to & in the notation ‘T[Ao]’. This will be harmless in the 
proof of Theorem 1.5.6 by the following observation: 


Lemma 1.5.2 Let 2 be an elementary substructure of B and let Ag C A be 
finite. 
Then T|Ao] will be the same if we view Ao as a subset of A or as a subset of B. 


Proof 
This Lemma is just a reformulation of the definition of elementary substructures. 


Definition 1.5.3 Let 2( be a model for T. We call 2 saturated if all 1-types in 
T[Ao] are realized in 2% whenever Apo is finite. 


Lemma 1.5.4 Assume that T has only countably many n-types for each n. Let 
AE T. Let Ag C A be finite. 
Then T[Ao] has only countably many 1-types. 


Proof 

Let Ap = {a1,...,@,} and let X be a 1-type in T[Ao]. 

Let X’ = {G(a1,...,%n,%) 3 b(a1,..., Gn, 2) € Xf. 

Clearly, X’ is a consistent set, so X’ can be extended to an n + 1-type. 

If X and Y are different 1-types in T[Ao], then for some ¢(x) in L(Ao), ¢ € 
X \7Ag@ €Y. It follows that X’ and Y’ cannot be extended to the same n + 1- 
type. Since there are at most countably many n + 1-types in T’, there are at 
most countably many 1-types in T[Ao]. 


Lemma 1.5.5 Let 2 be a model for T, Ap C A a finite set and X a 1-type in 
T[Ao]. Then there is an elementary extension of & realizing X. 


Proof 

If X is principal, 2 will realize X. So assume that X is not principal. Then 2 
must be infinite (why?). Extend L by constants c, for each element a € A, and 
extend T' to the complete theory Th(2). 

Let c be a new constant and extend Th() to T* by adding the axioms 


o(c) whenever ¢ € X. 
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c# Cq for alla € A. 


We then use the compactness theorem to see that T* has a model 8. B will be 
an elementary extension of 2 realizing X. 


Theorem 1.5.6 Assume that T has at most countably many n-types for each 
neN. 
Then T has a saturated model. 


Proof 

The final model will be the directed limit of an elementary directed system, and 
if the reader has not solved Exercise 1.7 so far it is about time to do so before 
any further reading of this proof. 

We will start with a countable model 29 with domain Ap. At stage k, assume 
that we have constructed a countable model 2%, with domain Az. Then we 
select a finite subset B, C A, and a l-type X;, in T[B,] and let 2,41 be an 
elementary extension of 2l;, realizing X;. Here we use Lemma 1.5.5. 

In order to obtain a saturated model in the end, we must organize the selection 
of By and Xx such that the following are satisfied: 


e Whenever B C Ay, is finite, there are infinitely many k’ > k such that 
B= By. 


e Whenever B C Ay is finite and X is a 1-type in T[B], then there is a 
k' > k such that B= Bry and X = Xp. 


This organization requires some standard techniques, and the reader acquainted 
with those may skip the rest of the proof. For the rest of the readers, we offer 
the details: 

When 2; is constructed, let {Ci,~}iew be an enumeration of all the finite subsets 
of A, (that are not subsets of A,_1 when k > 0; this is not essential). By Lemma 
1.5.2 T[C;,x] will be a fixed theory for the rest of the construction. 

For each i and k, let {Xj,:,2};en be an enumeration of all 1-types in T[C;,,]. 

If k’ > 1, we may write k’ in a unique way as 


ki! = 273'5*'m 


where m is not divisible by 2, 3 or 5. Then we let Byr = Cj, and Xp = Xj,i,x. 


Theorem 1.5.7 Let & and B be countable models for T where B is saturated. 
Then there is an elementary embedding from 2 to B. 
If both A and B are saturated, they are isomorphic. 


Proof 

We show how to construct an elementary embedding in the first case. The 
isomorphism in the second case can be constructed by a back-and-forth con- 
struction using our construction at each step. 

Let A and B be the domains of 2 and B. Let A = {a, ; k © N}, let 
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Ax = {a0,.--,@%—1} and let 7, : Ay — B be injective. 
We say that m» is a partial elementary embedding if for all sentences ¢ in L(Ax), 


AE Ge BE 


where $7* is the sentence in L(%) obtained from ¢ by replacing each name for 
an element a € A, by the name for 7,(a). Observe that for k = 0, 7% is the 
empty function, and then it will be a partial elementary embedding just because 
2 and B are elementary equivalent. 

Now let By = {mx(a) ; a € Ax}. 

Let X be the 1-type in T|A,] realized by az, and let X’ be the corresponding 
1-type in T[B,;] using the transformation +> 7". 

Since % is saturated, X’ is realized by some b € B. 

We extend 7, to t41 by ™e41(ax) = b where b € B realizes X’. Then 7,41 is 
a partial elementary embedding and the construction can go on. 

At the end, the union 7 of all the 7;’s will be an elementary embedding from 
Ato B. 


The number of countable models 


The possible cardinalities of the set of isomorphism classes of countable models 
of a complete countable theory is fully determined, the cardinalities can be 


1,3,4, ... No, Ni, 2%. 


That the number 2 is left out is not a typo, but is one of the more peculiar 
results in model theory, a result we will discuss further. 

The theory DO* has three different models. Elaborating on this kind of con- 
struction we may find theories where the number of models are any number # 2. 
The theory ACF'(0) has countably many models. 

There are examples of theories with continuumly many models. One example 
is DO!, an extension of DO defined as follows: 


e Let {ci,;}ien,jen be new constants. 
e Let c5 < cy; be an axiom if i <7’ or ifi =7 and j < j’. 


Then DO! is a complete theory because the restriction of DO! to a finite set of 
constants will be w-saturated. 

DO! will have continuumly many non-isomorphic countable models. Each model 
2 will determine a subset Xq of N by n € Xy if there is an element between all 
the pj’ and Cn+41,9- All sets can be realized this way, and isomorphic structures 
clearly define the same set. 


In order to prove that other cardinalities are out of the question, we need to 
go beyond first order logic, and we need background theorems from descriptive 
set theory. This is outside the scope of this compendium. It is just recently 
established (2002) that X; is a possible alternative, even in the case that the 
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continuum hypothesis fails. The construction behind this argument is far be- 
yond our ambitions here. 

We do however have the tools available in order to show that there are never 
exactly 2 isomorphism classes of models. We give a sketch of the proof. The 
reader is invited to fill in the details in the (minor) gaps in the proof of Lemma 
1.5.9 and to prove Corollary 1.5.10 


Definition 1.5.8 A model 2 for T is weakly saturated if each n-type is realized 
in 2 for each n EN. 


Lemma 1.5.9 Assume that the number of models for T is finite, but > 1. Then 
T has a model that is weakly saturated, but not saturated. 


Sketch of proof 

T has a countable, saturated model 2. Let 24,...,2, be the other countable 
models. Assume that neither of these are weakly saturated, and for each i < k, 
let X; be an n,;-type not realized in 2;. Without loss of generality, we may 
assume that the variables used for the types X; are distinct, so X,U---UX x can 
be viewed as a set of formulas in the variables e.g. 71,...,@n where n = yey Nye 
Let X be an n-type extending X; U---UX;,. Then X cannot be realized in 2; 
for any i<k. 

Let c1,...,C€, be new names, and let T* be T extended with the axioms 
@(C1,---,€n) for O(a1,...%n) € X. Since Mt is saturated, we may interpret 
the constants c1,...,¢, in 2 in such a way that 2 becomes a model for JT”. 
Moreover, two such models will be isomorphic. Thus T* is w-categorical and 
there are finitely many n-types in T* for each n. It follows that there are only 
finitely many n-types in T for each n, which contradicts the assumption. 

This ends the sketch. 


Corollary 1.5.10 There is no complete countable first order theory with exactly 
two isomorphism classes of countable models. 


This corollary follows from the lemma and most of the results established ear- 
lier in this section, but the proof will not require any new technical arguments. 
There is no complete countable theory with exactly two non-isomorphic count- 
able models. 


1.6 w-logic 


The prime object of consideration in this chapter has been the model theory 
of first order logic. We have seen that for many purposes, first order logic is 
inadequate. Logicians have been studying other forms of logic where some of 
the tools from first order logic can be used. There are at least three directions to 
go, we may extend the use of quantifiers beyond the setting of first order logic, 
we may consider infinitary proof trees and we may consider infinite connectives. 
The first alternative is natural if we want to form logics adequate for mathe- 
matical analysis or e.g. for the study of Noetherian rings. The third alternative 
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was used when the possible numbers of models for a complete first order theory 
was characterized in full, something we chose not to do. 

In this section we will consider a form of infinitary logic for models where the 
natural numbers is a definable substructure. This actually requires infinite proof 
trees, since we cannot have the compactness theorem to be valid in this case. 


1.6.1 w-logic 


For the rest of this section, let L be a first order language with (among possibly 
other symbols) a unary predicate symbol 2, a constant 0 and a unary function 
symbol S. We will let T be a first order theory over the language L. 


Definition 1.6.1 Let 2 be a model for T. 
We call 2 an w-model if Q is interpreted as N, 0 as zero and S' as the successor- 
function. 


Definition 1.6.2 Let T be a theory as above. T is w-sound if T has an w- 
model. 
A formula ¢ in L is w-valid if @ is valid in all w-models for T. 


We may formulate a logical system adequate for this concept of validity: 


Definition 1.6.3 w-logic will be first order logic extended with four axioms 
and one infinitary rule: 


e Let ko be the constant 0, and let k,,41 be the term S(k,). 
e 0(0) and Q(z) — Q(S(zx)) are new axioms in w-logic. 
e 04 S(x) and S(x) = S(y) — # = y are new axioms in w-logic. 


e If gf is an w-theorem in T for all n € N, then V2(Q(x) — ¢) is an 
w-theorem in 7’. 


e We will write T -,, ¢ when ¢ is an w-theorem in T. 


Remark 1.6.4 We will not give a precise definition of an w-proof, because in 
order to do so we need to develop a theory of infinite, well founded trees. The 
important fact is that we have used recursion to define the class of w-theorems, 
and then we can prove lemmas and theorems about them using induction over 
this construction. 

In Exercise 5.14 we will make use of the concept of well founded trees introduced 
in Chapter 4 and give a precise definition of an w-proof. The present section 
will only be technically sound on the basis of this exercise. 


w-logic is of interest for several reasons. One reason is that the induction 
axiom in first order number theory is provable in w-logic. Assume that we have 
a proof of 


9(0) 
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and that we have a proof of 
Va(Q(x) A (2) > (S(a))). 


We then clearly have w-proofs for each $(kn) and by the w-rule we have a proof 
of 
Va(Q(2) > o(2)). 


In some odd respect, this is a simplification; the w-logic version of Peano arith- 
metic is better behaved from a proof-theoretical point of view, and analyzing 
the w-logic versions of proofs in number theory gives us information about the 
strength of number theory. This kind of application requires a much deeper 
involvement in proof theory than we are prepared for in this compendium. 


1.6.2 w-completeness 


We will prove the completeness theorem for w-logic. One motivation is to 
demonstrate the power of the proof method of the first order version. We will 
leave many of the details as exercises for the reader. 


Definition 1.6.5 T is w-consistent if there is no sentence ¢ A 7@ that is an 
w-theorem in T. 


We have the standard equivalent versions of the completeness theorem for w- 
logic: 


Lemma 1.6.6 The following are equivalent: 
1. A formula ¢ is valid in all w-models for T if and only if @ is an w-theorem 
in @. 
2. A theory T is w-consistent if and only if T has an w-model. 
The proof is left as Exercise 1.25. 


Lemma 1.6.7 The theorem of constants holds for w-logic. 


The proof is left as Exercise 1.26. 


Lemma 1.6.8 Let L be as above, T be an w-consistent theory over L and Ax 
be a sentence in L. Let r be a new name, not in L. 
Then T, Azo — 7 is also w-consistent. 


The proof is left as Exercise 1.27. 


Using this lemma, we may construct the Henkin-extension of an w-consistent 
theory such that all finite subtheories are w-consistent as well. We cannot 
conclude that the full Henkin extension is w-consistent, simply since 
w-consistency is not closed under directed unions of theories. We will see how 
we can avoid this obstacle. 
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From now on, T is a fixed w-consistent theory. Let ro, 71, . . . be the new 
constants that we need in order to construct the Henkin extension H of T, and 
let Loo be the extended language. Let H, be the subtheory of H where we only 
added the constants r9,...,7%n—1 to T and the corresponding Henkin axioms. 
Then r,, is not in the language of H,,. 

Let {¢n}nen be an enumeration of all sentences in L,, such that ¢,, is a sentence 
in the language of Hy. 


Lemma 1.6.9 Let S be an w-consistent theory, @ a sentence in the language 
of S. Then at least one of the theories S,@ and S,7@ are w-consistent. 


The proof is left as Exercise 1.28. 


Lemma 1.6.10 Let S be an w-consistent theory, t a closed term in the language 
of S and assume that S,Q(t) is w-consistent. 
Then there is a number n such that S,t = ky is w-consistent. 


Proof 

Assume that S,t = ky is inconsistent for all n. 
Then t #k,, is an w-theorem in S for all n. 
Using the w-rule we obtain 


Sty Va(Q(2z) = x At). 
This means that S F,, =Q(t), contradicting the assumption. This ends the proof 


of the lemma. 


We will now construct the theory T.. 

Let To = Ho =T. 

Assume that T,, is an w-consistent extension of H, within the language of H,,. 
By Lemma 1.6.8, TU H,,41 will be w-consistent. 

If T, U Hn+1, én is not w-consistent, we let 


Dye = Th U An+w1, 7dn.- 
If T, U An+1, On is w-consistent, but ¢, is not of the form Q(t), let 
Th41 = Th; Anat, dn: 


If Tn, O Hn41, On is w-consistent, and ¢, is of the form Q(t), we choose one m 
such that 


Th41 = Tn U Anat, t = km 
is w-consistent. 

Let T,. be the union of all T,.. We cannot prove directly that T,. is w- 

consistent, but T., is a complete Henkin theory in the traditional sense, so the 
term model 2 of T,, will be a model for T,, and then in particular for T. 
Q* will be the set of equivalence classes of closed terms t such that T,, - Q(t). 
For each such ¢, there will be a number n € N such that T,, | t = ky. Thus 
Q* will be isomorphic to St. This indirect argument shows that 2% is an w- 
model after all, and the completeness theorem for w-logic is proved. Thus, Ts, 
is actually w-consistent. 
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1.7 Exercises to Chapter 1 


Exercise 1.1 Show that if 7 is an isomorphism, then 7 has an inverse 77! that 
is also an isomorphism. 


Exercise 1.2 Show that if T is an open theory, 2 is a model for T and 7 : 
$8 — A is an embedding, then B is a model for T. (This is the easy part of 
Los-Tarski’s theorem.) 


Exercise 1.3 Let L be a first order language, 2% an L-structure and ¢ an L- 
formula with at most 71,...,2, free. 


a) 


Let s be an assignment over 2%. Then 


QL [481568 8H AU 
[3] = (ere) 
whenever ¢ is a term with at most 71,...,%n free. 


= | ABA ys: 
2 = os] = al - i a hia ae 
whenever ¢ is a formula with at most 71,..., 2p free. 
This shows that the definition of interpretations via assignments and via 
new names for all elements of the structure in question are equivalent. 


Let ¢ be a term with variables among 71,...,2,, and let u1,...,u, and 
T1,-++;Tn be two sequences of closed terms such that u? = r? for each 7. 
hen: (ie sae eee rae 

Let ¢ be a formula with free variables among 71,..., Up, and let u1,...,Un 
and r1,...,7n be two sequences of closed terms such that u3! = r?! for each 


a. 
Then ( Ti yen YA = ( Th seeeyTm A 


U1 yes Un Tizeeln 


Exercise 1.4 Let 2 and % be given. 
Show that the following are equivalent: 


1; 
2 
3. 


There is an embedding a from 2 to B. 
Q is isomorphic to a substructure A’ of B. 


Q is a substructure of some 8’ isomorphic to B. 


Exercise 1.5 Work out a complete proof of Theorem 1.1.14. In particular: 


a) 


b) 
c) 


Show that if both 2 and % satisfy the requirements of a directed limit, 
then they are isomorphic. 


Show that the definition of ~ is independent of the choice of k. 
Show that if C1,...,C, are equivalence classes, there is a common 7 € I 


such that each C; have an element of the form (i, a;). 


Al 


d) Show that the definitions of f% and R™ are independent of the choice of 
a. 


e) Show that 7 as constructed in the proof is an embedding satisfying the 
requirement of the theorem, and that this is the only one doing so. 


Then write down a full proof where a) - e) are incorporated in a natural way. 


Exercise 1.6 Let L be a first order language (with equality) and let T be a 
complete theory over L. 


a) Show that either is each model of T infinite or all models of T will have 
the same finite cardinality. 


b) Show that if T has a finite model, then all models are isomorphic. 
Hint: Use Theorem 1.2.11. 


Exercise 1.7 Let ({i}ier, {mij }icj) be an elementary directed system with 
directed limit (20, {7;}:er). 

Uniformly in i € J, show by induction on the complexity of a sentence @ in 
L(Q;) that 7; preserves the truth value of ¢. 


Exercise 1.8 Show that if T is a countable, w-categorical theory, then T is 
complete. 


Exercise 1.9 Prove in detail that DO is w-categorical 


Exercise 1.10 a) Show that for each prime number p there is a finite ex- 
tension F'(p) of field theory such that the models are exactly the fields of 
characteristic p. 


b) Show that there is an extension F'(0) of field theory such that the models 
are exactly the fields of characteristic 0 


c) Let @ be a sentence in field theory. Show that if ¢ is true in fields of 
arbitrarily large finite characteristic, then ¢ is valid in some field of char- 
acteristic 0. 


d) Let ¢ be a formula in field theory. Show that if ¢ is valid in finite fields 
of arbitrarily large cardinality, then ¢ is valid in some infinite field. 


e) Show that the theory of finite fields or the theory of fields of finite char- 
acteristics cannot be expressed in first order logic. 


f) Show that the theory F'(0) or the theory of infinite fields are not finitely 
axiomatizable. 


Exercise 1.11 (For readers familiar with Zorn’s Lemma). 
Let 2 and % be algebraically closed fields and let lo and So be isomorphic 
substructures via the isomorphism 70. 
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a) Use Zorn’s lemma to show that 7 can be extended to a maximal isomor- 
phism 71 between substructures 2, and Bj. 


b) Show that 2; and By are algebraically closed subfields of 2 and B. 


Exercise 1.12 Let DO*™ be the theory in Example 1.3.22. 


a) The theory DO* has three countable, non-isomorphic models. Find ex- 
amples of these, and show that there are no more than three isomorphism 
classes of countable models. 

Hint: When you have found tree non-isomorphic examples, you may use 
the proof of the w-categoricity of DO to prove that there are no more. 


b) Does DOt have the isomorphism property? 


c) Show that DO* is complete. 
Hint: Show that certain finite subtheories of DOT are w-categorical. 


Exercise 1.13 a) Let T be a first order theory with both the isomorphism 
and submodel properties. Let 2( be a model for T. 
Show that T(2l) = TUD(2l) has the isomorphism and submodel properties. 


b) A theory T is called model complete if T U D(2) is complete whenever 
AE T. 
Show that if T is consistent and has the isomorphism and submodel prop- 
erties, then T’ is model complete. 


Exercise 1.14 Let § be a field, Pi(a1,...,2,) and Q;(a1,...,2%) be polyno- 
mials over § fori <n and j <™m. 
Show that there is a solution to the simultaneous set of equations 


P,(t1,..., 2) =0 


and inequations 
Q; (21, stays 2k) # 0 
in some field extending ¥ if and only if there is a solution in the algebraic closure 


of $. 


(This result is known as Hilbert’s Nullstellensatz.) 


Exercise 1.15 A subset of C* is called algebraic if it is the set of solutions of 
a polynomial equation 
P(a1,...,0%) =0. 


Show that whenever A C C* is definable by a first order formula in the language 
of field theory using parameters from C, then A is a Boolean combination of 
algebraic sets. 

What does this say about the expressive power of the language of field theory, 
in e.g. relation to complex analysis? 
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Exercise 1.16 Show that Q, which is Q with the standard algebra and order- 
ing, is a model for OF. 
Show that Q is a prime model for OF. 


Exercise 1.17 Carry out the definition of multiplication in the topological 
completion for the remaining cases. 

Verify that the topological completion is indeed an ordered field, and that the 
original field ¥ may be embedded into its topological completion. 


Exercise 1.18 Show that each polynomial function over a real closed field is 
€ — 6-continuous. 


Exercise 1.19 Revisit Exercises 1.14 and 1.15. 
Reformulate the results proved there to theorems about Real Closed Fields, and 
prove them. 


Exercise 1.20 Prove Theorem 1.4.4 in detail. 


Exercise 1.21 Let T be the complete theory DO* considered in Exercise 1.12. 


a) Find a non-principal 1-type as the completion of some infinite set of for- 
mulas. 


b) Hard Describe all the 1-types and show that there is exactly one non- 
principal 1-type. 


c) Which of the three models of DOT is the saturated one, which is the one 
that is weakly saturated but not saturated and which is the one omitting 
the one non-principal 1-type? 


Exercise 1.22 Let DO! be the theory introduced on page 36. Show that there 
is a countable model 2 for DO! such that all other models can be embedded 
into 2. 

Why does it follow that DO! has only countably many n-types for every n? 


Exercise 1.23 let L be the first order language (with equality) with an enu- 
merable set {P;};en of unary predicate symbols. 
Let T’ be the theory over L with non-logical axioms 


Fa(Eo(a) A-+* A En—1(2)) 


where each €; is either P; or -P;. 


a) Show that for each &,...,€,—1 as above and each number k we can for- 
mulate and prove in T that there are at least k different objects satisfying 
fo A+++ A n-1- 


b) Let L, be L restricted to the predicate symbols Po,..., Px—1 and let T;, 
be the set of sentences in Ly that are theorems in T’. 
Show that Ty is w-categorical. 


44 


c) Show that T is complete. 


d) Show that T has uncountably many 1-types. 


Exercise 1.24 Fill in the details in the following sketch of a proof of the omit- 
ting type theorem for n-types in general: 


e We may enumerate all ordered n-sequences of Henkin constants in a list 
{(Ciks +++ €n,k) Seen: 


e By induction on k, we may find a formula ¢, € X such that 
T U {7g (C14, -++5 nb) } 
is consistent with all the Henkin constants. 
Exercise 1.25 Prove Lemma 1.6.6. 


Exercise 1.26 Prove the theorem of constants for w-logic. 

Warning: The proof of the theorem of constants in the first order case use a 
variable x that occurs nowhere in the proof. In this setting, we do not have 
proofs, only provability. Moreover, if we had proofs, we might risk that all 
variables are used in the proof. Discuss how we can overcome this obstacle. 


Exercise 1.27 Prove Lemma 1.6.8. 


Exercise 1.28 Prove Lemma 1.6.9. 
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Chapter 2 


Finitary Model Theory 


In Chapter 1 we have touched a little bit on the subject of model theory. To 
a large extent we have been interested in countable structures; implicitly we 
considered finite structures to be too simple and uncountable structures to be 
too advanced. The main reason why we did not go on with more model theory 
is that we need time and space for computability theory. 

In some respects, the finite structures are simple, but in other respects they are 
the only interesting mathematical structures there are. For instance, a database 
is normally a finite, but dynamic, structure. Designing logical systems that are 
adequate for database theory is a challenging task. 

In this chapter we will offer the reader a glimpse of a growing field of logic, 

finite model theory. We are going to prove one of the classical results, the 
so called 0-1-law. In a sense, this result tells us that first order logic without 
constants and function symbols have remarkably restricted expressive power, 
and that we need higher order logic to say something intelligent about a finite 
structure. 
This chapter will have three sections. In the first section we will prove the 0-1- 
law, in the second section we will extend the language to a 2. order language, 
and see how the expressive power increases. The final section will be the set of 
exercises. 


2.1 The O-1-law 


For the rest of this chapter, L will be a fixed first order language without 
constant symbols, without function symbols, and with finitely many predicate 
symbols. 


Definition 2.1.1 a) A complete open description will be an open formula 
M(a1,...,%n) that is a maximal consistent conjunction of literals in the 
variables 71,...,2%, displayed. 
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b) If M(a1,...,%,) and N(a,...,2n,Y1,---,Yx) are complete open descrip- 
tions, then N is an extension of M if=& N— M. 


Sometimes it is convenient to use an alternative notation for long conjunctions 
and disjunctions. In the literature, the notations like A\,-,¢; and \/;<; ¢: are 
sometimes used to form infinitary formulas where conjunction and disjunctions 
are taken over infinitely many subformulas. We will only use the notation when 
the abbreviations ¢;, \--- A @;, and ¢;, V--- V $j, are a bit cumbersome. We 
will also use the notation Z to mean the list of variables 71,...,7,, and 37 will 
be an abbreviation for 471 --- dry. 

When variables are displayed in a formula like M(#) it is understood that there 
will be no free variables that are not on display. 


With these conventions in place, we will let T’ be the theory over L where 
the non-logical axioms are all formulas of the form 


VE((J\ vi # 2) A M(2)) > y(f\\y Axi A N(Z,y))) 
ij i 


whenever M is an open complete description and N is an open complete de- 
scription extending M. 


Theorem 2.1.2 Let L and T be as above. Then T is consistent, has no finite 
models, and is w-categorical. 


Proof 

We first prove that T is consistent, by constructing a model 21. The idea of the 
proof is to ad one element at each step, and at each step make one instance of 
one of the axioms true in the final structure. 

Each step is simple, we take an object out of the blue and just make the instance 
under consideration true by brute force. It is the way we organize the steps in 
order to ensure that all instances from the final model of all axioms are taken 
care of that may seem complicated to the reader. Readers with experience from 
such arguments (similar to, but actually simpler than, our construction of a 
saturated model for a theory with only countably many element types) may 
skip parts of this argument. 

We will construct an increasing sequence {2,},en of structures with domain 
{O, e024}: 

Let Ag be the structure on {0} where R*°(0,...,0) holds for all relation symbols 
Rin L. 

Let {(Mi, Ni)}ien be an enumeration of all pairs of complete open descriptions 
M(#) and N(#,y) such that N extends M. 

Let n; be the arity of Zin Mj. 

For each i, let {@;,;};en be an enumeration of all non-repeating sequences of 
natural numbers where the sequence has length n;. We may assume that each 
component Gj; is bounded by 7. 

Then at step k = 2'(27 +1) we know that @;,; is a non-repeating sequence from 


AT 


the domain of 2,1, so M;(@;,;) is a legal instance, with truth value 1 or T. 
If M;(G;,:) is false, we extend %,_1 to A; by adding k to the domain and making 
all new instances of atomic statements true. 

If M,(G;,;) is true, we extend %,_; to A, by adding k to the domain and 
interpreting new instances of atomic formulas such that N(@;,,%) will hold. 
This will be possible since N is an extension of M. If N does not tell us how 
to interpret an instance of an atomic statement, let it be true. 

In this way we will ensure that all the axioms of T will be valid in the limit 
structure 21 on N, and we do have a model. Thus T is consistent. 


Now assume that % is a finite model of T with a non-repeating enumeration 
{bi,...,0n} of its domain. Let M(a1,...,2%,,) be the unique (up to equivalence) 
complete open description satisfied by b1,..., Bn. 

M will have an extension to a complete open description N(a1,...,%n,y)- 
Since 8 — T we have that 


BE ayy # bj A M(b1,---,ns¥)), 


which is impossible by the choice of b1,..., Bn. 

Finally we will show that T is w-categorical. If 2 and B are two countable mod- 
els for T, we will construct an isomorphism using a back and forth construction 
like the ones we have seen before. 

A partial isomorphism will, in this proof, be a map p : Ap — Bo where 
Ao © A, Bo C B, p is 1-1 and onto and such that whenever a1,...,@, are 
in Ap and R is an n-ary relation symbol, then R™(a,,...,@n) if and only if 
R® (p(a1),---yP(An))- 

Given a@1,...,@, there is a unique complete open description M(a,...,¢n) 
(unique up to equivalence) such that M(a1,...,@,) holds, and then we may 
rephrase the definition of a partial isomorphism to: 

If Ap = {a1,..-,@n}, Bo = {p(a1),...,p(an)} and M is the complete open 
description of {a1,...,@,} then M is also the complete open description of 
{p(ai),..-,;p(an)}. Let p and a ¢ Ao be given. Let M be the complete open 
description of Ap described above, and let N be the complete open description 
of Ap U {a}. The axiom for M and N just implies that p can be extended to a 
q defined on a. 

By symmetry we also have that if p: Ag — Bo is a partial isomorphism and 
b € B, then p can be extended to a partial isomorphism gq such that b is in the 
range of q. 

Then, as before, we can construct an isomorphism between 2 and % by piece- 
wise extensions. This ends the proof. 


We will now restrict our attention to L-structures 21, where the domain A is a 
set {0,...,2 —1}. Then the number of L-structures will be finite, let k(L, 7) 
denote this number. 


Definition 2.1.3 Let ¢ be a sentence in L. 
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a) Let k(¢, L,n) be the number of L-structures over {0,...,n—1} satisfying 
¢ and let the n-probability of ¢ be 
_ k(¢,L,n) 
Min(@) = “k(L,n) | 


b) Let the asymptotic probability of ¢ be 


Lx(¢) = lim pzn(?) 


n—oo 


provided the limit exists. 
We will drop the subscript L from now on. 
Lemma 2.1.4 If ¢ is an axiom of T, then u(d) = 1. 


Proof 
Let @ be the axiom 


v(f\ 4%, #2; \ M(Z) > ay(\\y # x; \ N(@,y))) 
ikj i 


where = 2j,...,2m.- 
Assume that m <n. We will estimate jun (=¢) from above, i.e. 


Qn = Hn (Say +++ Atm ( \ as # 2j A M(Z) AVy( Ay # ai > AN(Z,y)))). 
iAj i 

We will consider formulas where we have used elements from {1,...,n} as in- 

stances, without making too much fuss about how to do this strictly according 

to the book. The local probability of a sentence of this kind is defined in the 

obvious way. 

There are n(n — 1)---(m— m+ 1) ways of selecting distinct elements for 

L1,.+-;%m, and by symmetry they are equally probable. Thus we have that 


an = 


n(n—1) +++ (n—m+1)-fin(M(0,... sm—Davy(/\y #i—-N(0,...,m—1,y))) 


a 


which is bounded by 
Dn, =n" tin(M(0,...,m—1) AVy([\y #i > =N(0,...,m— 1,y))) 


which again is bounded by 


m-1 


Cn = 1 + pn (Vy( \ y #i—>-7N(0,...,m—1,y))). 
i=0 
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Let &,...,& be all the literals in N with an occurrence of the variable y. Then 


m—1 l 
Cn = 0 + pin (yl N y#ir V 7€;(0,...,m—1,y))). 
i=0 j=l 


For each k with m < k <n and j < 1, the probability of €;(0,...,m—1,k) is 
exactly 5. Thus we may evaluate 


n l 


cn =n™- TT tn(\/ 76)(0,-.-.m—1,9)) =n — 5 )™. 


i=m j=l 


Since m and | are fixed and (1 — 3) < 1, we have that 


This proves the theorem. 


Corollary 2.1.5 Let L and T be as above, ¢ a sentence in L. Then (od) = 1 
or iu(@) =. 


Proof 
Since T is w-categorical, we know in particular that T is complete. 
If T+ ¢, there are axioms ¢1,...,@% in T such that 


$1,---, Or F @. 


Since 


it follows that u(d) = 
(4) = 0 since (6) = 


By the same argument, if T + 7a@ we have that 


An application to graph theory 


In this section we will let a directed graph simply be a binary relation R on a 
set G. In order to get the feeling of a graph-theoretical environment, we will 
call the ordered pairs (a,b) for oriented edges with tail a and head b, and we call 
the elements of G for nodes. Graph theory also covers the case of non-oriented 
edges, the possibility of several edges between two nodes, no edges from a node 
to itself etc. For all these definitions of graphs, a similar phenomenon as the 
one we will discuss takes place. 


A path from a to b will be a finite sequence of edges €1,...,¢€n such that 
a is the head of e}. 


b is the tail of en. 
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For all i < n, the head of e; is the tail of e;41. 


A path like this will have length n. We say that the graph is connected if there 
is a path from a to b for all nodes a and b (this concept is more commonly 
used for undirected graphs, so do not use it in a context independent of this 
section). We will call a graph intimately connected (not a standard concept in 
any context) if all nodes a and b are connected via a path of length at most 2. 


Corollary 2.1.6 Let P(n) be the probability that an arbitrary graph on 


{0,...,n} is intimately connected. 
Then 

lim P(n)=1. 
Proof 


Let L be the language of graph theory, and let T be as above. Consider the 
sentence 


@ =VaVysz(R(a2, z) A R(z,y)). 
¢ will be a theorem in 7’, and thus have asymptotic probability 1. 


This example shows that most finite graphs are non-interesting. In an effort 
to make this corollary sound interesting, we claim that it demonstrates that one 
must show some care when describing the virtue of probabilistic algorithms. 
The probabilistic algorithm ‘Yes’ solves the problem ‘is G connected?’ with 
better and better probability the larger the finite graph G is, but it is of course 
absolutely worthless. 

In Exercise 2.2 we show that most transitive finite binary relations are rather 
non-interesting. 


2.2 Second order languages 


We have shown that the first order theory of algebraically closed fields of a fixed 
characteristic is complete. We have shown that the natural numbers cannot be 
characterized up to isomorphism by any first order theory. In this chapter we 
have shown the 0-1-law of finite model theory for first order formulas. 

These and other results indicate that first order languages are poor in expressive 
power. 

These languages are called first order because we only accept quantifiers over 
the domain of the structure. A standard mathematical format of the induction 
axiom for number theory is as follows: 


VA(0OE AAVe E N(x € A> a4+1€ A) > Vr E N(x € A)). 


Here we let A vary over all sets, in particular over all subsets of N. Quantifiers 
ranging over all subsets of a domain, or more generally, over all predicates of 
some fixed arity on a domain, will be called second order. We will be more 
precise in a while, but notice already now that the natural numbers can be 
described up to isomorphism by second order formulas, see Exercise 2.4. 
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One problem with interpreting a second order quantifier is that the interpre- 
tation is not absolute. By this we mean that the powerset of an infinite set is 
not fully understood. There are second order sentences in number theory where 
the truth value will depend on chosen axiomatisations of set theory. This will 
be discussed at more depth in a course on axiomatic set theory. If we restrict 
our attention to finite structures, the same argument against second order logic 
in general is not valid. Given a finite set we have, at least theoretically, a full 
control of the power set. 

We will show that the expressive power of second order quantifiers is such 
that the 0-1-law does not hold any more. A more systematical treatment of 
second order logic and other extensions of first order logic is beyond the scope 
of this text. 


Definition 2.2.1 a) A second order language L? will be a first order lan- 
guage L extended with variables X* for predicates of arity k. 


b) If t1,...,t, are terms in L, then X#(t,,...,t,) is a new atomic formula. 


c) The class of second order formulas is closed under boolean connectives 
and first order quantifiers in the same way as the first order formulas are. 


d) If # is a second order formula, then IX*¢ and VX*¢ are second order 
formulas. 


We will not go through the full definition of how second order formulas are 
interpreted over an L-structure. The point is that they can be interpreted, and 
that if 2 and % are isomorphic L-structures, then they will satisfy the same 
second order sentences. 

From now on we will restrict ourselves to the first order language Di of 
equality, and its second order extension L2.. Then there is exactly one structure 
with domain {0,...,n — 1} for each n. 


Theorem 2.2.2 There is a second order sentence that is true exactly for the 
finite structures with an even number of elements. 


Proof 

In the sentence below, X will be a predicate variable of arity 1 and Y will bea 
predicate variable of arity 2. 

Let us first look at the sentence, and discuss why it works afterwards: 


AX AY WasyY (x,y) A VysrY (a, y) 

MavyV2(Y (a, y) AY (a, z) > y = 2) 
MavyV2(Y (2, y) AY (z,y) — @ =z) 
MVay(Y (@,y) > (X(@) @ >X(y))))- 


The first and third conjuncts express that Y is the graph of a function fy. 
The second and fourth conjuncts express that fy is onto and one-to-one, i.e. 
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a bijection. The last conjunct expresses that fy is a bijection between X and 
the complement of X. If the domain of the structure is finite, we may find 
interpretations of X and Y satisfying this exactly when the total number of 
elements is an even one. 


Clearly the sentence above does not satisfy the 0-1-law, so we have proved 


Corollary 2.2.3 The 0-1-law is not satisfied in general by second order sen- 
tences. 


In the proof of Theorem 2.2.2 we saw that we can express that Y is the graph 
of a function and other properties of functions using first order expressions in 
the variable Y. Other properties that we may express is 


1. X? is finite. 
2. X? is infinite. 
3. (X1,Y?) is a well ordering 


4. A ring is Noetherian (which means that any decreasing sequence of ideals 
is finite) 


The verifications of the first three facts are left as Exercise 2.5. The proof of 
Fact 4. is easy if you know the definitions of Noetherian rings and ideals, but 
will be very hard if you do not know these definitions. 


2.3. Exercises to Chapter 2 


Exercise 2.1 Show that there is an upper bound on the length of a complete 


open description M(x1,...,%) depending on n and the signature of the lan- 
guage, and that whenever M(a1,...,%n) is a complete open description and 
Y1,---;Yk are new variables, then M has an extension to a complete open de- 


scription N(21,...,%n,Y1;--+;Ym): 


Exercise 2.2 In automata theory, in logic and also in other branches of math- 
ematics or theoretical computer science we sometimes define a relation by con- 
sidering the reflexive and transfinite closure of a one-step-relation (e.g. one-step 
action of a Turing machine or one-step deduction in a formal theory). The 
transitive closure of a binary relation cannot be axiomatized in first order logic 
(if you did not know this, consider it as a part of this exercise to prove it) and 
is defined by 


R* (a,b) @ An € N5ay,...,dn(a = a1 Ab =a, AVi <n Raj, a;41)). 


Let R be an arbitrary binary relation on {0,...,n}. Show that with asymptotic 
probability 1, R*(i,7) for all i,j <n. 
What are the asymptotic probabilities of the interesting properties 
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R is reflexive? 
R is nowhere reflexive? 
R is transitive? 
R is symmetric? 
Exercise 2.3 Let L be the language with = as the only symbol. 


a) Show that for every sentence ¢ in L and every n we have that pin() € 
{0, 1}. 


b) Show that the function n +> p,(¢) will be eventually constant for each 
sentence @. 


c) Evaluate from where the function n + p,(¢) is eventually constant, in 
terms of some number related to ¢, and use this to show that the pure 
theory of equality is decidable. 


Exercise 2.4 Find a set of first and second order sentences in the language of 
number theory that will have isomorphic copies of St as the only models. 


Exercise 2.5 A standard set-theoretical characterization of infinity is that a 
set A is infinite exactly when there is a bijection between A and a proper subset 
of A. Use this to show that we may express that a set is finite by a second order 
formula. 

Find a second order formula that express that Y? is a well ordering of X!. 

If you want to go on with harder problems, we suggest that you continue with 
Exercise 5.4. 
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Chapter 3 


Classical Computability 
Theory 


3.1 The foundation, Turing’s analysis 


In Leary [1] the recursive functions are defined as those that can be represented 
in elementary number theory. f : N* — N is recursive if there is a formula 
O(@1,..-,X%%,y) such that for all ny,...,n%,m we have that f(m1,...,n~) =m 
if and only if 

NE @(€ny3+++5 ngs ¥) CY = Cm: 


Here c, is the numeral for n, and N is elementary number theory. 

The advantage of this definition is that it is well suited for proving Gédel’s 
incompleteness theorem without introducing too many new concepts. The prob- 
lem is that there is absolutely no conceptual analysis of the notion of computabil- 
ity behind this definition. 

Gédel defines a class of recursive functions by recursion (pun intended). His 
aim is to define a sufficiently rich class for handling algorithms for e.g. substitu- 
tion of a term for a variable, and for coding the meta-theory of a formal theory, 
but sufficiently simple to enable us to show that any recursive function will be 
definable, and actually, representable as described above. Gédel’s computable 
functions are now known as the ji-recursive ones. 

We are going to base our study of computability on an approach due to 
Kleene, and we are going to restrict ourselves to computable functions defined 
on the natural numbers. In many respects, computing can be described as 
manipulation of symbols following a given set of rules. The symbols are not then 
natural numbers, and different ways of representing natural numbers (binary, 
decadic, via numerals, Roman figures etc.) might give different concepts of 
computing with numbers. 

The best mathematical model for computability and computations is due 
to Alan Turing. He defined what is now known as Turing machines, small 
finite state machines operating on an infinite one-dimensional tape and doing 
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symbol manipulations on this tape. The machine will, between each step of the 
computation, be in one of finitely many states. It will read one of the symbols 
on the tape, and dependent of the state it is in and the symbol on the tape that 
it reads, it will according to a fixed rule change its state, rewrite the symbol and 
move to the symbol to the right or to the left. It may of course stay in the same 
state, it may of course keep the symbol on the tape as it is, and sometimes it 
may be convenient to let it stay where it is. We think of the tape as consisting 
of squares, like an old fashioned movie-tape. 
A Turing machine M is determined by 


1. A finite alphabet © including one special symbol Bl for an empty square 
of the tape. 


2. A finite set K of states, with one special state s € K called the initial 
state. 


3. A partial function 6 : (X x K) — (Xx K x {L,S,R}), where L means 
“left”, R means ”right” and S means ”stay where you are”. 


In the literature you will find many variations in the definition of a Turing 
machine, but they all have the same computational power. In this exposition, 
we decided to let the function 6, which rules the action of the machine, be 
partial. If 6(0,p) is undefined, the machine will be in a halting situation, which 
means that the computation comes to an end. We are not going to give a precise 
definition of the operational semantics for Turing machines, but are content with 
an intuitive description: 


The operational semantics of Turing Machines 


Let M = (%, K,s,6) be a Turing machine. 

The starting configuration of M will consist of a word w (called the input) in 
written on an otherwise blank tape that is infinite in both directions, a position 
on the tape and the initial state s. 

At each step, the machine M may enter a new state, rewrite the symbol at its 
position on the tape and shift its position one square to the right or to the left, 
all according to the function 6. 

If M is in stage p and reads the symbol o, and d(0,p) is undefined, M halts. 
Then we who observe M will know this, and we will be able to read the content 
of the tape, which will be called the output. 

Normally there will be some conventions for how to organize the input con- 
figuration, e.g. that there should be no blanks in the input word and that the 
machine will be started at the first blank square to the left or to the right of 
the input word. Then the following makes sense 


Definition 3.1.1 Let “po be an alphabet not containing the symbol Bl 

Let 6 be the set of finite words over Xo. 

Let f : 44 — XG be a partial function. 

We say that f is Turing computable if there is a Turing machine M over an 
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alphabet © D No such that if M is started with w € &* on the tape, then it 
halts if and only if f(w) is defined, and then with f(w) as the output word. 


Turing claimed that a Turing machine can 

e Search systematically for pieces of information. 
e Remember pieces of information 

e Rewrite contents 


all according to fixed rules. As an example we may consider the map n +> n! and 
the map m+> m"™. We all agree that these maps are in principle computable, 
but try to think about how a computation of 10!°! could be carried out: We 
would need a lot of book-keeping devices in order to be at the top of the situation 
at each stage, but nothing that is not covered by the three items above. 

We follow Turing in claiming that his model is a good mathematical model 
for algorithmic computations. We are, however, not going to make this com- 
pendium into a study of Turing machines. The interested reader should consult 
other textbooks on the subject. 

The two key results are: 


Theorem 3.1.2 There is a fixed alphabet © such that for any alphabet d’, any 
Turing machine M over X! may be coded as a word [M] in o and every word w 
in X! may be coded as a word [w] in X such that the partial function 


U([M][w]) = [M(w)] 


is Turing computable, where we write M(w) for the output word if the input 
word is w. 


This is known as the existence of a Universal Turing Machine. 
The Halting Problem is the following: 


Given a Turing machine M and an input w, will M eventually come to a halt 
when started on w? 


Theorem 3.1.3 There is no Turing machine H that solves the Halting Problem 
in the following sense: 


A((M][w]) will always halt, and will halt with an empty output word if and only 
if M halts on w. 


Our approach to computability will be more in the original style of Gédel, we 
will study functions defined for natural numbers only. However, the results that 
we obtain will be relevant for the more general approaches as well. This is based 
on what is known as the Church-Turing Thesis, which we phrase like this: 


All algorithms can be simulated by a Turing Machine, 


and the fact that Turing-computability can be reduced to the notion we will 
be working with. This is left as one of the minor projects in Chapter 5, see 
Exercise 5.6. 
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3.2 Computable functions and c.e. sets 


3.2.1 The primitive recursive functions 
The basic definition 


Recursion means ‘backtracking’, and in pre-Church/Kleene mathematics the 
term recursive function was used for the functions defined by iterated recursion. 
In this pre-Church/Kleene context a recursive definition will be a definition of 
a function on the natural numbers, where we give one initial value f(0), and 
define f(k+1) as a function of f(k) and k. E.g. Skolem used the term ‘recursive 
function’ in this way. Following Kleene, we will call these functions primitive 
recursive. 

We let Z and ¥ etc. denote ordered sequences of natural numbers of some 
fixed length. Normally the length will not be specified, but will be clear from 
the context. 


Definition 3.2.1 The primitive recursive functions f : N” — N will be the 
least class of functions satisfying: 

i) f(z, ¥) =x +1 is primitive recursive. 
ii) Iim(t1,..-,2n) = x; is primitive recursive. 


) 
iii) f(Z) =q is primitive recursive for each g € N. 
) 


iv) Ifg is n-ary and primitive recursive, and f,..., fn, are m-ary and primitive 
recursive, then the composition 


is primitive recursive. 


v) If g and A are primitive recursive of arity n and n + 2 resp., then f is 
primitive recursive where 


f(0,9) = 9%) 
f@+1,9) = AF (@, 9), #9) 
We say that f is defined by primitive recursion or by the recursion operator 
from g and h. 
The pool of primitive recursive functions and sets 


Standard number theoretical functions like addition, multiplication, exponenti- 
ation and factorial will all be primitive recursive. For instance f(r,y) = x+y 
may be defined by 


-r7+0=2 


-x+(ytl)=(e4+y) 41. 
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Subtraction is not primitive recursive for the simple reason that it leads us 
outside N. We define a modified subtraction. This will be primitive recursive, 
see Exercise 3.1. 


Definition 3.2.2 We let — be the modified subtraction defined by 
z-y=an-yify<2 
z-y=Oifa<y. 
By the definition of the primitive recursive functions, all projection maps 
Lppl@ijes: stp): = 2; 


are primitive recursive. We can use this to consider any function of a set of 
variables as a function of a larger set of variables, as in 


f (£1, £2, 3) = g(11,3(#1, £2, £3), 13,3(@1, v2, £3)) = g(X1, @3). 


Thus we will not need to be concerned with the requirement that every function 
in a composition must be of the same arity. This will simplify some of the 
descriptions of primitive recursive functions. 


Definition 3.2.3 Let A C N”. The characteristic function of A will be the 
function 


Ka: N° AN 
that is 1 on A and 0 outside A. 


Definition 3.2.4 A set A C N” is primitive recursive if the characteristic func- 
tion K 4 is primitive recursive. 


The primitive recursive sets will form a Boolean algebra for every dimension, 
see Exercise 3.2. 

If A C N* is primitive recursive and f,g : N* — N are primitive recursive, 
we may define a primitive recursive function h by cases as follows: 


h(@) = Ka(@)- f(@) + -Ka(@))- 9(@)). 


We see that h(Z) = f(Z) when # € A and h(Z) = g(Z) otherwise. 

Every set defined from = and < using propositional calculus will be prim- 
itive recursive. We may also use functions known to be primitive recursive in 
the definition of primitive recursive sets, and primitive recursive sets when we 
describe primitive recursive functions. We will leave the verification of most of 
this to the reader, just state the properties of primitive recursion that are useful 
to us. The proofs are simple, and there is no harm leaving them as exercises. 
We will however prove one basic (and simple) lemma: 
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Lemma 3.2.5 Let f : N'+” = N be primitive recursive. Then the function g 
defined as the bounded product 


g(x, 9) = [J fe.) 
Zur 
will be primitive recursive. 


Proof 

We define g by primitive recursion as follows 
g(0, y) = f(0, y) 

gz +1,¥) = g(x,y) : fle+ LG) 

By the same argument we can show that the primitive recursive functions 
will be closed under bounded sums. What is more important to us is that 
the primitive recursive sets will be closed under bounded quantification. This 
is an important strengthening of the language we may use to define primitive 
recursive sets and functions. 


Lemma 3.2.6 Let A C N!*” be primitive recursive. Then the following sets 
are primitive recursive 


a) B= {(x,¥); dz < y((z,¥) € A)} 
b) C= {(x,9); Vz < y((z,9) € A)} 
The proof is left as Exercise 3.3. 


In computability theory, the j:-operator is important. Within primitive re- 
cursion theory we may often use bounded search, or a bounded p-operator: 


Lemma 3.2.7 Let f :N!+" =N be primitive recursive. Then 


is primitive recursive, where g(x,y) is the least z such that f(z,y) = 0 if there 
is one such z < x, while g(x, i) = x otherwise. 


g can be defined using primitive recursion and definition by cases. The details 
are left as Exercise 3.4. 

The interplay between the primitive recursive functions and the primitive 
recursive sets is ruled by the following principles: 


Theorem 3.2.8 a) Every set definable from = and < using primitive recur- 
sive functions, boolean operators and bounded quantifiers will be primitive 
recursive. 


b) Every function defined by the schemes of primitive recursion, bounded 
search over a primitive recursive set and definition by cases over a finite 
partition of N” into primitive recursive sets will be primitive recursive. 


There is no need to prove this theorem, since it in a sense is the synthesis of 
what has been proved so far. The consequence is the level of freedom in defining 
primitive recursive functions and relations we have obtained. This freedom will 
be sufficient when we later claim that certain facts are trivial to prove. 


60 


Sequence numbers 


One important use of primitive recursion is the coding of finite sequences. Gédel 
needed an elaborate way of coding finite sequences via the so called 3-function. 
As we mentioned above, it was important for Gédel to show that the recursive 
functions are definable in ordinary number theory. Since this is not important 
to us to the same extent, we will use full primitive recursion in coding such 
sequences. All proofs of the lemmas below are trivial and can safely be left for 
the reader. 


Lemma 3.2.9 a) The set of prime numbers (starting with 2 as the least 
prime number) is primitive recursive. 


b) The monotone enumeration {pisien of the prime numbers is primitive 
recursive (with po = 2). 


We now define the sequence numbers: 


Definition 3.2.10 Let x9,...,2%n—1 be a finite sequence of numbers, n = 0 
corresponding to the empty sequence. 
We let the corresponding sequence number (x0,...,%n—1) be the number 
n-1 
2”. Il Pit 
i=0 


If y = (@o,..-,%n—1), we let lh(y) (the length of y) be n and (y); = 2%. 
If « = (%,.--,%n-1) and y = (Yo,---,Ym—1), we let x * y be the sequence 
number of the concatenation, i.e. 


LR Y= (09; 2) Lai; YOss s8y Ym—1) 


Sometimes, when it is clear from the context, we will misuse this terminology 
and let 


a * (%,.-.-,%n—-1) = (@,20,---,;£n—1)- 


Lemma 3.2.11 a) The set of sequence numbers is primitive recursive, and 
the sequence numbering is one-to-one (but not surjective). 


b) The function lh is the restriction of a primitive recursive function to the 
set of sequence numbers. 


c) The function coor(y,i) = (y)i ( the i’th coordinate of y) defined for se- 
quence numbers y of length > i is the restriction of a primitive recursive 
function of two variables. 


d) For each n, the function 
$€Gn(Z0,---,)Ln—-1) = (£0,---,Fn-1) 


is primitive recursive. 
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e) For all sequences %o,...,%n-1 andi<n, 4; < (X0,---,;%n—1) 
f) 1 is the sequence number of the empty sequence. 
h) Concatenation of sequence numbers is primitive recursive. 


It makes no sense to say that the full sequence numbering is primitive re- 
cursive. However any numbering satisfying Lemma 3.2.11 can be used for the 
purposes of this course, so there is no need to learn the details of this defini- 
tion. Occasionally it may be useful to assume that the sequence numbering is 
surjective. This can be achieved, see Exercise 3.5. 


Primitive recursion is technically defined on successor numbers via the value 
on the predecessor. Sometimes it is useful to use the value on all numbers less 
than the argument x in order to define the value on x. In order to see that this 
is harmless, we use the following construction and the result to be proved in 
Exercise 3.6: 


Definition 3.2.12 Let f:N—N. 
Let f(n) = (f(0),...,f(n—1)). 
We will sometimes use an alternative coding of pairs that is both 1-1 and 
onto: 
Definition 3.2.13 Let 
1 
P(x,y) = 5 (e+)? + 3x +y) 


It can be shown that P : N? — N is a bijection. Let 7, and m2 be the two 
projection maps such that for any x 


P(m (x), 72(x)) = x. 


P, 7 and 7 are primitive recursive. The verifications are left for the reader as 
Exercise 3.5 e). 


Ackermann proved that there is a total computable function that is not 
primitive recursive. His observation was that in the list of functions 


fo(x,y) =“ 1, fi(z,y) =2+Y, fo(a,y) = wy, f3(x,y) = x4 


each function but the first is defined as a y-iteration of the previous one. 
For n > 3 we may then define 


fn(x,0) =1, fr(w,y +1) = fr—1(fn(2, y), 2) 


This defines the generalized exponentiations, or the Ackermann-branches. The 
three-place function 

f(n, x,y) = fn(x,y) 
is not primitive recursive. 
We will not prove Ackermann’s result as stated, but in Exercise 3.7 we will see 
how we may define a function using a double recursion that is not primitive 
recursive. It will be easier to solve this exercise after the introduction to Kleene 
indexing. 
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3.2.2. The computable functions 
The p-operator 


We extend the definition of the primitive recursive functions to a definition of 
the computable functions by adding one principle of infinite search. We will 
consider the construction 


g(@) = px. f(x,Z) =0. 


In order to understand this definition, we must discuss what we actually mean, 
i.e. which algorithm this is supposed to represent. 

Intuitively we want pa.g(z,£) = 0 to be the least x such that g(x,7) is 0. 
If g(a,Z) € N for all x, this is unproblematic, we search for this least x by 
computing g(0,Z) , g(1, 2%) and so on until we find one value of x giving 0 as the 
outcome. Now, if there is no such x we will search in vain, or in more technical 
terms, our procedure will be non-terminating. This forces us to introduce partial 
functions, i.e. functions being undefined on certain arguments. This further 
forces us to be careful about our interpretation of the pz-operator, we may in the 
search for the least x such that g(x, Z) = 0 face a z for which g(z, Z) is undefined 
before we find a z for which g(z,Z) = 0. In this case we will let ux.g(x,Z) =0 
be undefined, realizing that the algorithm for computing this number that we 
have in mind will be non-terminating. 

With this clarification we ad the following 


Definition 3.2.14 The computable functions is the least class of partial func- 
tions f : N” — N satisfying i) - v) in the definition of the primitive recursive 
functions, together with the extra clause 


vi) If g: N°+! — N is computable then 
f(€) = px.g(x,Z) =0 
is computable. 


A set is computable if the characteristic function is computable. 
A total computable function is a computable function terminating on all inputs 
from the domain N”. 


Remark 3.2.15 Gédel’s p-recursive functions will be the total computable 
functions 


We must have in mind that the characteristic function of a set is total, so 
dealing with computable sets and with total computable functions will be much 
of the same. This is made precise in the following lemma: 


Lemma 3.2.16 Let f: N” — N be total. Then the following are equivalent: 


i) f is computable. 
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ii) The graph of f, seen as a subset of N*+!, is computable. 


Proof 
Let A be the graph of f, AK the characteristic function of A. 
If f is computable, then 


Ka(@, y) _ K-(f(Z),y) 


and K is primitive recursive, see Exercise 3.1, g) and e). 
If kK, is computable, then 


f(@) = by.1=Ka(a,y) = 0, 
so f is computable. 


Remark 3.2.17 This result will not hold for primitive recursion, there will be 
functions with primitive recursive graphs that are not primitive recursive. 


There is an interplay between the total computable functions and the com- 
putable sets resembling Theorem 3.2.8 as follows 


Theorem 3.2.18 a) Any set definable from computable sets and total com- 
putable functions using boolean valued operators and bounded quantifiers 
will be computable. 


b) If AC N"*+! is computable, then g is computable, where 
g(Z) is the least number z such that (z,#) € A. 


The proof is trivial. 


Kleene’s T-predicate 


The definition of the computable functions is technically an inductive defini- 
tion of a class of partial functions. It is, however, important to be aware of 
the computational interpretation of this definition, the so to say ‘operational 
semantics’. Every partial computable function is given by a term in a language 
with constants for each number, the +1 function and symbols for the recursion 
operator and the p-operator. 

This operational semantics then tells us that there are actual computations go- 
ing on. Now we will define the concept of a computation tree. A computation 
tree will be a number coding every step in a computation up to the final out- 
come. In order to be able to do so, we need a Gédel numbering, or an indexing, 
of the computable functions. Observe that the numbering will not be 1-1, we 
will enumerate the algorithms or the terms, and then only indirectly enumerate 
the computable functions. 


Definition 3.2.19 For each number e and simultaneously for all n we define 
the partial function $? of n variables % = (a1,...,%n) as follows: 


64 


= (1), let @2(%) =a, +1. 
= (2,1) and 1 <i<n, let 62(Z) = 4%. 
3,4), let 62(@) = 4. 
,e',dy,...,dm) let 


v) Ife = (5,d,,d2) then 

ee (0, ) = $3, (#) 

e(e+1,2) = ag ia l(a, #), 2,2). 
vi) If e = (6,d) then 


$2 (E) = n2.07""(z, #) =0 
Otherwise If neither of i) - vi) above applies, let ”(Z) be undefined. 


Remark 3.2.20 We have defined ¢.(#) for every index e and every input Z, 
either as an undefined value or as a natural number. Indeed, if we get a natural 
number, this number will be unique, see Exercise 3.8. 

When no misunderstanding should occur, we will drop the superscript n and 
write ¢-(Z) instead of $2 (2). 


Definition 3.2.21 We write ¢.(Z)| if there is a y with ¢.(@) = y. We then 
say that ¢-(Z) terminates. 


We are now ready to use the sequence numbering and this indexing to define 
computation trees. Each terminating computation will have a unique computa- 
tion tree, a number coding each step of the computation from the input to the 
output. We will actually be overloading this code with information, but for our 
purposes this is harmless. What is important to us is that information retrieval 
will be easy. 


Definition 3.2.22 Let ¢.(Z) = y. By primitive recursion on e we define the 
computation tree of $-(Z) = y as follows, assuming that the index e will be the 
index of the corresponding case: 


i) (e,£,21 +1) is the computation tree for ¢.(x, 7) =#+1. 


ii) (e,#,2;) is the computation tree for @.(#) = a;. 


)< 
oS 
) (e,# 
) fe, 


iii) (e,Z,q) is the computation tree for ¢.(Z) = q. 

iv) (e,t,t1,...,tn,y) is the computation tree in this case, where each t; is 
the sorplitation tree for ¢4,(@) = z; and t is the computation tree for 
be (Z) = y. 
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v) (e,0,t,y) is the computation tree for ¢.(0,£) = y when t is the computa- 
tion tree for $a, (Z) = y. 
(e,a + 1,t1, t2,y) is the computation tree for ¢-(# + 1,#) = y when ty is 
the computation tree for ¢.(x,Z) = z and tz is the computation tree for 
dy (Z, 2,4) = y. 

vi) (e,to,..-,ty-1,ty,y) is the computation tree in this case, where ¢; is the 
computation tree for @g(i, Z) = z A 0 for i < y and t, is the computation 
tree for daly, Z) = 0. 


We say that ¢ is a computation tree for ¢”(£) if for some y, t is the computation 
tree for 6? (#) = y. 


We are now ready to define Kleene’s T-predicate: 


Definition 3.2.23 Let 
TC, Ci yess; Enyt) 


if t is a computation tree for ¢.(@1,..., Un) 
We will normally write T instead of T;. 
Theorem 3.2.24 a) For each n, Tn is primitive recursive. 


b) There is a primitive recursive function U such that if t is a computation 
tree, then U(t) is the output of the corresponding computation. 


c) (Kleene’s Normal Form Theorem) 
For every arity n and all e we have 


be(L1,---,%n) = U(ut.T,(e, 21,...,%n,t)). 


Proof 

It is only a) that requires a proof. The proof of a) is however easy, we construct 
the characteristic function of T,, by recursion on the last variable t. Monotonisity 
of the sequence numbering is important here. We leave the tedious, but simple 
details for the reader. 


Corollary 3.2.25 For each number n, the function 
f(e,21,...,%n) = be(11,...,2n) 
is computable. 


Remark 3.2.26 Corollary 3.2.25 is the analogue of the existence of a universal 
Turing machine, we can enumerate the computable functions in such a way 
that each computable function is uniformly computable in any of the numbers 
enumerating it. This universal function is partial. There is no universal function 
for the total computable functions, see Exercise 3.9. 
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The Recursion Theorem 


The recursion theorem is one of the key insights in computability theory in- 
troduced by Kleene. In programming terms it says that we may define a set 
of procedures where we in the definition of each procedure refer to the other 
procedures in a circular way. The proof we give for the recursion theorem will 
be a kind of ‘white rabbit out of the hat’ argument based on the much more 
intuitive S”-theorem. So let us first explain the 57'-theorem. Let f be a com- 
putable function of several variables. Now, if we fix the value of some of the 
variables, we will get a computable function in the rest of the variables. The 
S” -theorem tells us that the index for this new function can be obtained in a 
primitive recursive way from the index of the original function and the values 
of the fixed variables. We have to prove one technical lemma: 


Lemma 3.2.27 There is a primitive recursive function p (depending on n) such 
that if 


$2 (%1,---;%n) =t 


and 
Ll<i<n 
then 
-1 
Hy(evixarg) (ELD see Vj-1,Vj41,--- ,In) =t. 
Proof 


We define p by induction on e, considering the cases i) - vi). We leave some of 
the easy details for the reader, see Exercise 3.11. 

oe(f) =a, +1: 

If 1 <i let p(e,i,x;) =e, while p(e, 1,21) = (3,4, + 1). 

The cases ii) and iii) are left for the reader. 

If e = (4, e’,d1,...,dn), we simply let 


p(e, i, x4) = (4, e’, o(d1, i, zi), oe »P(dn,t, Xi)). 


Case v) splits into two subcases. If 1 < i, this case is easy. For i = 1 we let 
p(e,1,0) = dy. 

p(e,1,a +1) = (4, (3,2), p(e, 1,2), dz, (2, 1),...,(2,n)) 

where (3,z) is the index for f(#) = x and (2,7) is the index for the function 
selecting x; from #. In this case the primitive recursion is replaced by an iterated 
composition, the depth of which is determined by the value of x. 

Case vi) is again easy, and is left for the reader. 


Theorem 3.2.28 (The 57,-theorem) 
Letn >1,m>1. There is a primitive recursive function S?, such that for all 


€,%1,-++5Un,Yms--++5Ym 


Per (ay, see5 Uns Y1,--- Ym) = DS (ey y...50n) YL Sees Ui) 
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Proof 
Let p be as in Lemma 3.2.27. 
Let Si (e,x) = p(e,1,2) 
Let Skt (e,a1,...,e41) = p($%41(e,21,---, 2x), 1,e41)- By an easy induc- 
tion on k we see that this construction works for all m. 

The S*-theorem is a handy tool in itself, and we will use it frequently 
stating that we can find an index for a computable function uniformly in some 


parameter. Now we will use the $7-theorem to prove the surprisingly strong 


Theorem 3.2.29 (The Recursion Theorem) 
Let f(e,£) be a partial, computable function. 
Then there is an index e€, such that for all # 


deo (£) & f(eo, £). 


Proof 
Recall that by this equality we mean that either both sides are undefined or 
both sides are defined and equal. Let 


gle, Z) = f(Sz(e, e), Z) 
and let g be an index for g. Let 
eo = $n(9, 9): 
Then 
Geo(#) = 81 (4,a)(#) = b4(G,#) = 9(G,%) = f(5)(9, 9), #) = feo, 2). 

Remark 3.2.30 Readers familiar with the fixed point construction in untyped 
A-calculus may recognize this proof as a close relative, and indeed it is essentially 
the same proof. The recursion theorem is a very powerful tool for constructing 
computable functions by self reference. In Chapter 4 we will use the recursion 
theorem to construct computable functions essentially by transfinite induction. 
Here we will give a completely different application, we will prove that there is 


no nontrivial set of partial computable functions such that the set of indices for 
functions in the class is computable. 


Theorem 3.2.31 (Riece) 
Let ACN be a computable set such that ife € A and ¢ = da thende€ A. 
Then A=N or A=9. 


Proof 

Assume not, and leta€ Aandb¢ A. 

Let f(e,x) = d(x) ife € A and f(e,x) = da(a) ife ¢ A. 

By the recursion theorem, let eg be such that for all x 
f(€0,®) = Geq(x). 

If eg € A, then de, = dp 80 €p € A. 

If eg ¢ A, then de, = da SO €0 € A. 

This is a clear contradiction, and the theorem is proved. 
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Corollary 3.2.32 (Unsolvability of the halting problem) 
{(e, x) ; be(x) |} is not computable. 


Remark 3.2.33 Riece’s theorem is of course stronger than the unsolvability of 
the Halting Problem, for which we need much less machinery. 


3.2.3. Computably enumerable sets 
Four equivalent definitions 


An enumeration of a set X is an onto map fF: N — X. For subsets of N we 
may ask for computable enumerations of a set. A set permitting a computable 
enumeration will be called computably enumerable or just c.e. The standard 
terminology over many years has been recursively enumerable or just r.e., be- 
cause the expression recursive was used by Kleene and many with him. We will 
stick to the word computable and thus to the term computably enumerable. 
Of course there is no enumeration of the empty set, but nevertheless we will 
include the empty set as one of the c.e. sets. 
In this section we will give some characterizations of the c.e. sets. One impor- 
tant characterization will be as the semi-decidable sets. A computable set will 
be decidable, we have an algorithm for deciding when an element is in the set 
or not. In a semi-decidable set we will have an algorithm that verifies that an 
element is in the set when it is, but when the element is not in the set, this 
algorithm may never terminate. A typical semi-decidable set is the solving set 
of the Halting Problem 

{(e,2) 5 e(x)l}. 


Another example is the set of theorems in first order number theory or any 
nicely axiomatizable theory. The set of words in some general grammar will 
form a third class of examples. 

We will show that the semi-decidable subsets of N will be exactly the c.e. 
sets. A third characterization will be that the c.e. sets are exactly the sets of 
projections of primitive recursive sets. In the literature this class is known as the 
i9-sets. A fourth characterization will be as the ranges of partial, computable 
functions. 

This is enough talk, let us move to the definition: 


Definition 3.2.34 Let A C N. We call A computably enumerable or just c.e. 
if A= 9 or A is the range of a total computable function. 


Theorem 3.2.35 Let ACN. Then the following are equivalent: 
i) A is ce. 
ti) A is the range of a partial computable function. 
iti) There is a primitive recursive set S C N? such that 


A={n; AIm(n,m) € S} 
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iv) There is a partial computable function with domain A. 


Proof 

Since the empty set satisfies all four properties, we will assume that A # @). 

i) => ii): 

Trivial since we in this compendium consider the total functions as a subclass 
of the partial functions. 

ii) = iii): 

Let A be the range of ¢¢. 

Then 


neé As sAy(T(e, m(y), m2(y)) An = U(m2(y))) 


where 7, and 7g are the inverses of the pairing function P, T is Kleene’s T- 
predicate and U is the function selecting the value from a computation tree. 
The matrix of this expression is primitive recursive. 

iii) > iv): Let 


n€ AS dAm((n,m) € S). 


where S is primitive recursive. Then A is the domain of the partial computable 
function 

f(n) = pum.(n,m) € S. 
iv) > i): 
Let A be the domain of ¢, and let a € A (here we will use the assumption that 
A is non-empty). 
Let f(y) = mi(y) if T(e,m(y), m2(y)), f(y) = a otherwise. Then f will be 
computable, and A will be the range of f. 
This ends the proof of the theorem. 


Clearly characterizations ii) and iii) makes sense for subsets of N” as well 
for n > 1, and the equivalence will still hold. From now on we will talk about 
c.e. sets of any dimension. The relationship between c.e. subsets of N and N” 
is given in Exercise 3.14. 

The following lemma will rule our abilities to construct c.e. sets: 


Lemma 3.2.36 Let A CN” and B CN” be c.e. Then 
a) ANB and AUB are both c.e. 
b) Ifn=m-+k and both m and k are positive, then 
{@ ; Ay(#,y) € A} 


will be c.e., where # is a sequence of variables of length m and ¥ is of 
length k. 


Moreover, every computable set is c.e., the inverse image or direct image of a 
c.e. set using a partial computable function will be c.e. 
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All these claims follow trivially from the definition or from one of the char- 
acterizations in Theorem 3.2.35. 


Let us introduce another standard piece of notation: 


Definition 3.2.37 Let 


W.={n; ée(n)|} ={n; 3tT(e,n,t)}. 


Let 
Wem ={n; dt < mT (e,n,t)}. 


We let K be the diagonal set 


K={e;eew,}. 
Lemma 3.2.38 a) {(e,n); n € We} is ce. 
b) K is ce. 
c) Each set Wem, is finite. 
d) {(e,n,m); 2 € Wem} is primitive recursive. 


All proofs are trivial. 


Selection with consequences 


From now on we will prove lemmas and theorems in the lowest relevant dimen- 
sion, but clearly all results will hold in higher dimensions as well. 


Theorem 3.2.39 (The Selection Theorem) 
Let A CN? be c.e. Then there is a partial computable function f such that 


i) f(n)\|S dm(n,m) € A. 
ii) If f(n)l then (n, f(n)) € A. 


Proof 

This time we will give an intuitive proof. Let A be the projection of the primitive 
recursive set B C N® (characterization iii).). For each n, search for the least m 
such that (n,71(m),72(m)) € B, and then let f(n) = 7(m). 


Intuitively we perform a parallel search for a witness to the fact that (n,m) € 
A for some m, and we choose the m with the least witness. 


Corollary 3.2.40 Let A and B be two c.e. sets. Then there are disjoint c.e. 
sets C and D with 


CCA,DCBandAUB=CUD. 


71 


Proof 

Let E = (A x {0}) U(B x {1}) and let f be a selection function for E. 
Let C={n; f(n) =0} and D= {n; f(n) =1}. 

Then C and D will satisfy the properties of the corollary. 


Corollary 3.2.41 A set A is computable if and only A and the complement of 
A are ¢.e. 


One way is trivial, since the complement of a computable set is computable and 
all computable sets are c.e. So assume that A and its complement B are c.e. 
Let E be as in the proof of the corollary above, and f the selection function. 
Then f is the characteristic function of A, so A is computable. 


Corollary 3.2.42 Let f: N—N be a partial function. Then the following are 
equivalent: 


i) f is computable. 
ti) The graph of f is c.e. 


Proof 

If the graph of f is c.e., then f will be the selection function of its own graph, 
which is computable by the selection theorem. If on the other hand f is com- 
putable, then the graph of f will be the domain of the following function g(n, m): 
Compute f(n) and see if the result equals m. 


Computably inseparable c.e. sets 


In Exercise 3.16 we will see that two disjoint complements of c.e. sets can be 
separated by a computable set. This is an improvement of Corollary 3.2.42. 
Here we will show that a similar separation property does not hold for c.e. sets, 
and we will draw some consequences of this fact. 


Definition 3.2.43 Let A and B be two disjoint subsets of N. We say that A 
and B are computably separable if there is a computable set C' such that A C C 
and BNC =. Otherwise A and B are computably inseparable. 


Theorem 3.2.44 There is a pair of computably inseparable c.e. sets. 


Proof 

Let A= {e; ¢-(e) =0} and B= {e; ¢,(e) = 1}. 

Assume that C is a computable set that separates A and B, and assume that 
€g is an index for the characteristic function of C. 

Then, if e9 € C we have that ¢.,(e9) = 1. Then eg € B which is disjoint from 
Cc. 

Likewise, if eg ¢ C we see that eg € A C C. In both cases we obtain a 
contradiction, so the existence of C’ is impossible. 


Now this theorem has some interesting consequences concerning the differ- 
ence between classical and constructive mathematics. We will end our general 
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introduction to the basics of computability theory discussing some of the con- 
sequences. 


Definition 3.2.45 a) A binary tree is a non-empty set D of finite 0-1- 
sequences such that any initial segment of an element in D is also in D. A 
binary tree is computable if the set of sequence numbers of the sequences 
in D is computable. 


b) An infinite branch in a binary tree D is a function f : N — {0,1} such 
that (f(0),...,f(m—1)) € D for all n. 


Lemma 3.2.46 (K6nig’s Lemma) 
An infinite binary tree has an infinite branch. 


The proof of K6nig’s lemma is trivial, but has very little to do with com- 
putability theory. One constructs a branch by always extending the sequence 
in a direction where the tree is still infinite. 

Well known theorems proved by similar arguments will be that a continuous 
function on a closed bounded interval will obtain its maximum and that any 
consistent first order theory has a complete extension. 


Remark 3.2.47 The version of K6nig’s Lemma given above, restricting our- 
selves to binary trees, is often called Weak Kénig’s lemma. The full Konig’s 
lemma then refers to infinite trees with finite branching, not just binary branch- 
ing. There are results showing that Weak Konig’s lemma is equivalent, relative 
to some very weak formal theory, to the mentioned theorems from analysis and 
topology. 


We will show a failure of a constructive version of Kénig’s lemma: 


Lemma 3.2.48 There is an infinite, computable binary tree without a com- 
putable, infinite branch. 


Proof 
Let A and B be two computably inseparable c.e. sets and let {An}nen and 
{Bn}nen be primitive recursive sequences of sets A, and B, contained in 
{0,...,2 — 1}, with A= Ucn An and B= Unen Bn- 
Ifo is a 0-1-sequence of length n, we let o € Dif for alli< n:i€ A, > o(t) =1 
and i € B, => o(i) = 0. o will be a characteristic function on {0,...,n — 1} 
separating A, and By. 
Now the characteristic function of any set separating A and B will be an infi- 
nite branch in D. Since A and B are disjoint, D will be an infinite, binary tree. 
Moreover, D will be primitive recursive. A computable, infinite branch will on 
the other hand be the characteristic function of a computable set separating A 
and B, something assumed not to exist. This ends the proof. 

In Exercises 3.17 and 3.18 we will give an application of this lemma to 
propositional calculus and to constructive analysis. 
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3.2.4 Crosses and naughts 


Once upon the time there was a popular game known as “Crosses and naughts”. 
This game requires two players and a potentially large piece of paper split into 
squares. Player no. 1 writes a cross in one of the squares, player no. 2 writes 
a naught in one of the empty squares, player no. 1 writes a cross in one of 
the empty squares and so on. The first player who fills a row of five symbols, 
up-down, sideways or along a diagonal has won the game. 

In order to turn this into a mathematical problem, we make two assumptions: 


e The sheet of paper is infinite in all directions. 


e Each player must write a symbol within a prefixed distance of the game 
played so far at each move, e.g. in a square sharing at least one point with 
a square alreaddy filled. 


IWe will see that it is impossibe for player 2 to have a winning strategy. If player 
2 had a winning strategy, player 1 could pretend that she/he was player 2 and 
base her/his game on that strategy and on an imagined first move from the real 
player 2. If player 2 later makes this imagined move, player 1 just base her/his 
further game on a new imagined move. This will enable player 1 to winn. 

This leaves us with two possibilities: 


1. Player 1 will have a winning strategy. 
2. Player 2 has a strategy for dragging the game into infinite time. 
It follows by weak KG6nig’s lemma that one of the two will hold. 


Conjecture 3.2.49 Player 2 has a strategy for a never-ending game, but there 
is no computable strategy to this effect. 


It is a consequence of K6nig’s lemma that if Player 1 has a winning strategy, 
there is an upper limit to how man moves Player 1 will need in order to win 
the game. Then we may even abandon the assumption that the sheet of paper 
is infinite. To the author, this seems rather unlikely. This supports one part of 
the conjecture. The other part is not really supported by a belief, but by the 
appealing possibility that you might beat any computer in this game, if you get 
hold of how it is programmed, but you may not be able to beat an improvising 
human being. 


3.3. Degrees of Unsolvability 


3.3.1 m-reducibility 
Discussion 


Having introduced the main concepts of computability theory there are several 
options. One option will be to give a further analysis of the computable functions 
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and subclasses of them. This will lead us to complexity theory or to subrecursive 
hierarchies (see Section 3.6). One key concept then will be that one set or 
function can be reduced to another in some simple way. 

We will not consider such a fragmentation of the computable functions into 
interesting subclasses yet. Instead we will ask for reductions between possible 
non-computable functions and sets, using the complexity of computability itself 
for defining the notion of reduction. The connection between this approach and 
the ones mentioned above is that both the formation of interesting concepts and 
the methodology of those approaches to computability theory are based on the 
experience gained from investigating computable reductions in general. Thus, 
in an introductory course, where one should learn the computability theorists 
way of thinking, this section is basic. 


Definition 3.3.1 Let A and B be two subsets of N. 
We say that A is m-reducible to B, A <m B, if there is a total computable 
function f such that for all n: 


neEeAsf(n)EB. 


We read this as ‘many-one-reducible’, since f may be a many-one function. If 
we insist on f being injective, we will get 1-reducibility. 


There is no way we can reduce N to @ and vice versa, and we will exclude 
those trivial sets from our further discussion. We then get the observations: 


Lemma 3.3.2 a) If A is computable and B#N,9, then A < m B. 
b) A<m Bs (N\A) <m (N\ 3B). 
C) <m is transitive. 

The proofs are easy and are left for the reader. 


Definition 3.3.3 We call two sets A and B m-equivalent if A <m B and 
B<, A. We then write A =,, B. 


Clearly =, will be an equivalence relation with a partial ordering inherited 
from <,,. We call the set of equivalence classes with this induced ordering the 
m-degrees. 


Remark 3.3.4 Let us briefly discuss the distinction between a complexity class 
and a degree. A complexity class will consist of all functions and/or sets that 
may be computed or decided with the aid of a prefixed amount of computing 
resources. Resources may be the time or space available (normally as functions 
of the input), but to us it may also be the possible service of a non-computable 
oracle. This will be better explained later. A degree, on the other hand, will 
be some mathematical object measuring the complexity of a function or a set. 
It has turned out that the most useful object for this purpose simply is the 
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class of functions of the same complexity as the given object. It is like the old 
Frege semantics where 17 is interpreted as the set of all sets with 17 elements, 
and in general a property is interpreted as the set of all objects sharing that 
property. It remains to decide what we mean by ‘having the same complexity’, 
and we will consider two (out of many in the literature) possible choices. Being 
m-equivalent is one, and later we will learn about Turing-equivalent functions. 


The m-degrees have some properties easy to establish: 


Lemma 3.3.5 a) Let X be a finite set of m-degrees. Then X has a least 
upper bound. 


b) Let X be a countable set of m-degrees. Then X has an upper bound. 


Proof 

The computable sets will form the minimal m-degree. Thus the empty set of 
m-degrees has a least upper bound. In order to prove the rest of a) it is sufficient 
to show that {A,B} will have a least upper bound. Let 


A®B= {2n; n€E€ ASU{2n+1; neE B}. 


This will define the least upper bound, see Problem 3.19. 
In order to prove b), Let {A,, ; n € N} be a countable family of sets. Let 


A= UnenAn = {(n,m) ; m€ Ap}. 


Then the m-degree of A will bound the m-degrees of A, for each n € N. 


We will invest most of our efforts in analyzing the Turing degrees below. We 
will however prove one negative result, the m-degrees do not represent a linear 
stratification of all nontrivial sets into complexity classes. 


Lemma 3.3.6 There are two nontrivial sets A and B such that A £m B and 
BkmA. 


Proof 

Let A be c.e. but not computable. Let B=N\ A. 

If C <, A, C will be c.e., so B is not m-reducible to A. It follows from Lemma 
3.3.2 that A is not m-reducible to B. 


An m-complete c.e. set 


We proved that the computable inverse of a c.e. set is c.e. A reformulation of 
this will be 


Lemma 3.3.7 Let A <,, B. If B is c.e., then A is c.e. 


Thus the c.e. sets form an initial segment of all sets pre-ordered by m-reductions. 
First we will show that this set has a maximal element: 
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Lemma 3.3.8 Let K = {e; ¢-(e)|}. Then any other c.e. set is m-reducible to 
K. 


Proof 

Let A = {d ; ¢-(d) |}. Adding a dummy variable we can without loss of 
generality assume that A = {d ; ¢-(d,x)|} where the outcome of ¢.(d,2) is 
independent of x. Then A <,, K by 


A={d; Si(e,d) € K}. 


A natural question is now if there are c.e. sets that are not m-equivalent 
to K or the computable sets; are there more m-degrees among the c.e. sets? 
This was answered by Emil Post, when he classified a whole class of c.e. sets 
‘in between’. 


Simple sets 


Definition 3.3.9 A c.e. set A is simple if the complement of A is infinite, but 
does not contain any infinite c.e. sets. 


A simple set cannot be computable (why?). We will prove that there exist 
simple sets, and that K cannot be reduced to any simple sets. 


Lemma 3.3.10 There exists a simple set. 


Proof 

Let B= {(e,x); 2e< tA ¢,(x)I}. 

Let g be a computable selection function for B, i.e. g(e)| when (e,x7) € B for 
some x, and then g(e) selects one such «. 

Let A be the image of g. Then A is c.e. 

Since g(e) > 2e when defined, the complement of A will be infinite. This is 
because each set {0,...,2e} will contain at most e elements from A. 

If W. is infinite, W. will contain a number > 2e, and g(e) will be defined. Then 
ANW. 4%, so Wz is not contained in the complement of A. This shows that 
A is simple. 


Lemma 3.3.11 Let K <,, A where A is c.e. 
Then the complement of A contains an infinite c.e. set. 


Proof 
Let f be total and computable such that 


eE€KRSflel)ed. 


We will construct a computable sequence {x;};en of distinct numbers outside 
A, and use the recursion theorem to glue the whole construction together. 
The induction start will be the empty sequence. 
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Assume that B,, = {20,..-,%n—1} has been constructed such that B,, is disjoint 
from A. 

Let p(n) be such that Wn) = {e ; fle) € Bn}. By the uniformity of the 
construction, p will be computable. We will let x, = f(p(n)). 

If tp € By we have that x, ¢ A, so p(n) ¢ K, and p(n) ¢ Wyn). This 
contradicts the assumption that 2, € B, and the definition of W,(n). Thus 
tn € By. 

On the other hand, if x, € A, then p(n) € K, p(n) € W,,,) and by construction, 
tn € By, which we agreed was impossible. Thus x, is a new element outside A. 
We can then continue the process and in the end produce an infinite c.e. set 
outside A. 


Corollary 3.3.12 There is a non-computable c.e. set A such that K is not 
m-reducible to A. 


3.3.2 Turing degrees 
Relativised computations 


In the previous section we considered a notion of reducibility between sets based 
on the idea ‘A is simpler than B if we can gain information about A by making 
one computable call about membership in B’. Now, if we imagine that we 
are given the superhuman ability to decide membership in B, we must also 
permit ourselves to ask more intricate questions about B in order to decide 
membership in A, and still claim that we use our superhuman power to make 
A decidable. There are several notions of reductions based on the idea that 
we may be allowed any computable set of questions about membership in B 
and draw the conclusions from the answers. We will not consider any of those 
intermediate notions of reduction in this introductory course, but take the most 
drastic approach: We extend the definition of the computable functions by 
adding some extra functions as initial ones. This is technically described in the 
following definition: 


Definition 3.3.13 Let fi,..., f, bea finite list of partial functions f; : N —- N. 
a) We extend the definition of $,(Z) to a definition of 
Gn 


by adding a new clause 
vii) If e = (7, i,k) then oft---S(x, Z) = fi(x). If f;(x) is undefined, then 
this computation does not terminate. 


b) The computation tree will in this case be (7,1, k, x, f;(x)). 


We call these algorithms relativized algorithms meaning that the concept of 
computation has been relativized to fi,...,f,. One important aspect of this 
definition is the finite use principle. Even if we in reality accept functions as 
inputs in algorithms as well, no terminating algorithm will use more than a 
finite amount of information from the functions involved. 
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Lemma 3.3.14 Assume that oft--f*(@) = z. Then there are finite partial 
subfunctions fi,...,f, of fi,-..,fe resp. such that 

pit (Z) =z 
with the same computation tree. 


The proof is by a tedious but simple induction on e, using the fact that a 
sequence number is larger than all parts of the sequence. 

We have given and used an enumeration of all ordered sequences of natural 
numbers. In the same fashion we can construct an enumeration {& }icen of 
all finite partial functions such that the following relations will be primitive 
recursive: 


- &(x) 
- &(z)=y 
- dom(&) C {0,...,n}. 


The reader is free to ad any other important properties, as long as they are 
correct. 


Example 3.3.15 In order to avoid that the reader will focus on inessential 
details, we offer the construction of €; in the guise of an example. 

Let po, p1,p2,--. be an enumeration of the prime numbers, starting with po = 2, 
pi = 3 and so on. 

Define €; by 


&(n) = m if there are exactly m+ 1 factors of p, in i. 
&] if pn is not a factor in i. 


For example, if i = 23-74-11, then €;(0) = 2, (3) = 3 and €;(4) = 0. For all 
other inputs x, €;(a) will be undefined. 


The properties above are sufficient to prove the extended Kleene T-predicate: 


Lemma 3.3.16 For each n and k the following relation is primitive recursive: 
Tr ble, £,t1,...,t%,t) = t is the computation tree of aes, (Z). 


In order to save notation we will from now on mainly consider computations 
relativized to one function. Using the coding 


(fos. +++ fe—1)(@) = fry(x)(mod k)(M2(@)) 
we see that there is no harm in doing this. 


Lemma 3.3.17 There is a primitive recursive function c such that for any 
partial functions f, g and h, if for all x, f(a) = $2(ax) and for all x, g(x) = 
g(x), then for all x, f(x) = Pred) (x). 
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Proof 

c(e, d) is defined by primitive recursion on e, dividing the construction into cases 
i) - xi). 

For cases i) - iii), we let c(e,d) =e. 

For the cases iv) - viii) we just let c(e,d) commute with the construction of 
e from its subindices, i.e., if the case is e = expr(ei,...,€n) then c(e,d) = 
expr(c(e1,d),...,c(€n,d)), where expr can be any relevant expression. 

If e = (9,1,1) we have 49(2,Z) = g(x) = o/(x), so we let c(e,d) = d’ where 
6", (x, £) = oh (a). d’ is primitive recursive in d. 

Remark 3.3.18 There is an alternative proof of this lemma. By the finite use 
principle we see that if g is computable in h and f is computable in g, then the 


graph of f will be c.e. relative to h. Everything will be uniform, so we may 
extract c from this proof as well. 


We can use the concepts introduced here to talk about computable function- 
als, and not just about computable functions. 


Definition 3.3.19 Let F : NN > N, a functional of type 2. 
We call F computable if there is an index e such that for all total f : N— N we 
have 


F(f) = 48 (0). 


All computable functionals of type 2 will be continuous with respect to the 
canonical topologies. Kleene extended the notion of relativized computation to 
cover all functionals of any finite type as possible inputs. This will be discussed 
briefly in Chapter 4. 


The second recursion theorem 


We are now in the position to state and prove the second recursion theorem: 


Theorem 3.3.20 Let F(f,x) = ¢f(x) be a computable functional. 
Then there is a computable f such that for all x, 


Moreover, f will be the least such function in the sense that if 
Va(g(x) = F(g, x), 
then f Cg as graphs. 


Proof 

Let fo be the everywhere undefined function, and by recursion let fn41(%) = 
Since fo C fi we see by induction on n that f, C fn+4i for all n. 

Let f =U{fn ; n € N}. By the finite use principle we see that 


F(f,2) =y@ AnF(fn, 2) =y & Infosi(z) =y > f(x) = y. 
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Clearly f is the minimal solution to the equation, the minimal fized point of F. 
It remains to show that f is computable, i.e. that the graph of f is c.e. 

We will use the effective enumeration of all finite partial functions and the 
effective coding of sequences. Let € with indices vary over all finite partial 
functions. Then we can rewrite the following characterization of the graph of f 
int a ©9-form: 


f(x) =yo Ans, cee » En) (V2Eo(z)T 
AVi < n(Wz € dom(§i41) (i41(2) = F(&, z)) A &n(@) = y)). 
Remark 3.3.21 The existence of a computable fixed point for F' follows by the 
recursion theorem. It is possible to show that the index obtained by the proof 


of the recursion theorem will be an index for the least fixed point, see Exercise 
3.20. 


Turing Degrees 
We will now restrict ourselves to total functions. 


Definition 3.3.22 Let f and g be two functions. We say that f is computable 
in g if there is an index e such that for all zx, 


f(x) = $2(2). 
We write f <r g. We will then say thet f is Turing reducible to g. 
The key properties of Turing reducibility is given by 
Lemma 3.3.23 a) <r is a transitive relation. 
b) If f is computable and g is any function, then f <r g. 


c) If f and g are functions, there is a function h such that for any other 
function h’: 


f <rhandg <rh. 
If f <r bh! andg <qrh' thenh <r h’. 


Proof 
a) is a consequence of Lemma 3.3.17, b) is trivial, and to prove c), let 
h(2n) = f(n) and h(2n + 1) = g(n). 


Definition 3.3.24 Let f and g be two functions. f and g are Turing equivalent, 
in symbols f =r g, if f <r g and g <r f. 


=r will be an equivalence relation. The equivalence classes will be called Turing 
degrees or Degrees of unsolvability. We will simply call them degrees. We will 
let bold-face low case letters early in the alphabet , a, b, etc. denote degrees. 
The set of degrees has a canonical ordering < inherited from <r. 

We can summarize what we have observed so far in 
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Lemma 3.3.25 The ordered set of degrees is an upper semi-lattice with a least 
element such that every countable set is bounded, and every initial segment is 
countable. 


We leave the verifications for the reader. We will now see that there is no 
maximal degree: 


Lemma 3.3.26 Leta be a degree. Then there is a degree b such that 
a<b. 


Proof 

Let f € a. Let g(x) = $f (.)(mo(z)) +1 if Of (.)(m2(a))1, otherwise g(x) = 0. 
The proof of the unsolvability of the halting problem can be relativized to f so 
g is not computable in f. On the other hand, clearly f is computable in g. 


The g constructed in the proof above is called f’, the jump of f. The jump 
operator is indeed a degree-operator, see Exercise 3.21. 

We have shown that there are incomparable m-degrees. The same method 
can be used to show that there are incomparable Turing degrees, see Exer- 
cise 3.22. We will prove a stronger result, showing that no strictly increasing 
sequence of degrees will have a least upper bound. 


Theorem 3.3.27 Let {a;}icn be a strictly increasing sequence of degrees. Then 
there are two degrees b and c that are both upper bounds for the sequence, such 
that for any degree d, if d< b andd <c, thend <a; for somei EN. 


A pair b,c as above is called a perfect pair for the sequence. The degrees in 
a perfect pair for a sequence will be incomparable. Further, the existence of a 
perfect pair shows that the sequence will have no least upper bound. 


Proof 

Let {fitien be a sequence of total functions of increasing Turing degrees. We 
will construct the functions g and h as the limits of approximations g, and hz, 
where we in the construction of g,41 and hz+1 want to ensure that if ey = 71(x) 
and e2 = 72(x) and $2, = on, are total, then ¢2, is computable in f,. In order 
to simplify the notation, we let g and h be defined on N?, but this will alter 
nothing. 

In the construction we will preserve the following properties: 


1. Ift <a, then g,(i,n) and h,(i,n) are defined for all n. 

2. Ifi <a, then g,(i,n) = f;(n) for all but finitely many n. 
3. Ifi <a, then hz(t,n) = fi(n) for all but finitely many n. 
4. gx(t,n) is defined only for finitely many (7,n) with x < i. 


5. hz(i,n) is defined only for finitely many (7,n) with x < i. 
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This will ensure that g, and h, are equivalent to f,—1 for x > 0. 
Let go and ho both be the empty function. 
Now, let z > 0 and assume that g, and h, are defined satisfying 1. - 5. above. 
Let e, = 71() and eg = 72(x). What we will do next will depend on the answer 
to the following question: 
Can we find an n and finite extensions g’ of g, and h’ of h, such that 

g’ (n) # ge (n) and both are defined? 
(By a finite extension we mean that we ad a finite number of elements to the do- 
main.) If the answer is ‘no’, we extend gz to gz+1 by letting gz41(x,n) = f2(n) 
whenever this is not in conflict with the construction of g, ( a conflict that can 
appear at at most finitely many places), and we construct hzi1 from hz and f, 
in the same way. 
If the answer is ‘yes’, we first choose two such finite extensions, and then we 
construct g,+1 and h,,, from these extensions as above. This ends our con- 
struction. 

We let g(z,n) = gr4i(x,n) and A(x,n) = hy4i(z,n). In the construction 
we have tried as hard as possible to avoid that ¢2 = oe. The point is that we 
have tried so hard that if they after all turn out to be equal, they will both be 
computable in one of the f;’s. 

Claim 1 

fi; is computable in both g and h. 

Proof 

We have that fi(n) = g(i,n) except for finitely many 7, so f; is computable in 
g. The same argument holds for h. 


Claim 2 

For x > 0 we have that g, and hz, both are computable in f,—1 
Proof 

This is a trivial consequence of properties 1. - 5. 

Claim 3 


If ¢2 = ¢" and both are total, then ¢% is computable in f,, for some z. 

Proof 

Let « = P(e,d) where P is the pairing from Definition 3.2.13. Then in the 
construction of g,+, and h,+1 we ask for a y and finite extensions g’ and h’ of 
gx and h, such that oo (yA oh (y). If we had found some, we would let g and 
h be further extensions of one such pair of finite extensions, and then we would 
have preserved that $2(y) 4 $5(y), contradicting our assumption on e and d. 
On the other hand, for every y we can, by the assumption and the finite use 
principle, find finite extensions such that 62 (y)L and or (y)l. The point is that 
all these values must be equal, otherwise we could have found two extensions 
giving different values. Thus we can give the following algorithm for computing 
$2(y) from g, which again is computable in f,1: Search for any finite extension 
(by searching through the finite partial functions consistent with g,) g’ of gx 
such that ¢% (y)| The value we obtain will be the correct value. 

This ends the proof of our theorem. 
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3.4 A minimal degree 


3.4.1 Trees 


In the constructions of degrees we have performed so far, we have been using 
brute force. For instance, when we want to construct a minimal pair, we start 
with three properties: 


1. f is not computable. 
2. g is not computable 
3. If hk is computable in both f and g, then h is computable. 


We then split these properties into infinite lists of requirements, which we try 
to satisfy during the construction: 


R,,- If ¢- is total, then f ¥ de. 
Rae If de is total, then g F de. 
Rca If f = $4 is total, then $f is computable. 


Now, for any of these requirements and any pair o and 7 of finite sequences 
there will be finite extensions a’ and 7’ such that any further total extension f 
and g of o’ and 7’ resp. will satisfy the requirement. Thus by a step-by-step 
construction we can construct f and g via finite approximations satisfying one 
requirement at the time. The reader is invited to work out the full proof, see 
Exercise 3.23. 

We will now face a problem which we cannot solve by this simple method. 
We will show that there is a non-computable function f such that there is no 
function of complexity strictly between f and the computable functions. Again 
we will set up the relevant properties of f, and fragmentize them into a sequence 
of requirements we want to satisfy during the construction. The problem will be 
that we cannot ensure that these requirements are satisfied by considering just 
a finite approximation of f. Instead we will use trees, and we will satisfy the 
various requirements by insisting that f is a branch in a given binary tree. Before 
we can go into details with the argument, we will reconsider our definition of a 
binary tree, give a formulation that will be handy for this particular application. 
We will not distinguish between finite sequences and sequence numbers here, but 
whenever we say that a function defined from a set of finite sequences to the set 
of finite sequences is computable, we mean that the corresponding function on 
sequence numbers is computable. 


Definition 3.4.1 Let D be the set of finite 0-1-sequences. 


a) Ifo € D we let 0 * 0 and 0 x 1 be o extended by 0 or 1 resp. We extend 
this to the concatenation o * 7 in the canonical way. 


b) If o and 7 are two sequences, and i < lh(a), i < Ih(r) and o(i) € T(t), we 
say that o and 7 are inconsistent. 
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c) f: D— Dis monotone if f(c) is a proper subsequence of f(r) whenever 
o is a proper subsequence of T. 


d) A tree is a monotone function T : D — D mapping inconsistent sequences 
to inconsistent sequences. 


e) If S and T are trees, then S is a subtree of T if there is a tree T’ such that 
S=ToT’. 


f) If T is a tree and f : N — {0,1}, then f is a branch in T if for every n 
there is a sequence o of length n such that T(o) is an initial segment of 


f. 


Remark 3.4.2 These trees will sometimes be called perfect trees, the set of 
branches will form a perfect subset of the set {0,1}‘ in the topological sense, 
i.e. a set that is closed and without isolated points. 

Our intuition should be focused on the set Set(T) of sequences that are initial 
segments of the possible T'(c)’s. This will be a binary tree in the traditional 
sense, and we will have the same set of infinite branches. If S is a subtree of T, 
then Set(S) C Set(L). The converse will not hold in general. 


Lemma 3.4.3 Let {Ti}nen be a sequence of trees such that T,41 is a subtree 
of T, for alln. Then there is a function f that is a branch in all trees Ty. 


Proof 

Consider the set X of o such that o € Set(T;,) for all n. 

X will be a binary tree. The empty sequence is in X. If 0 € X, then for all 
n,o *0 € Set(T,) or 0 ¥ 1 € Set(T,,). At least one of these has to hold for 
infinitely many n and, since we are dealing with subtrees, for all n. Thus X is 
not a finite tree and by K6énig’s Lemma has an infinite branch, which will be a 
common branch for all T;,’s. 


Remark 3.4.4 Using topology we might just say that the intersection of a 
decreasing sequence of nonempty compact sets is nonempty, in order to prove 
this lemma. 


We will now see that certain computable trees can be used to meet natural 
requirements. As a first case, let us prove: 


Lemma 3.4.5 Let T be a computable tree and assume that @. is total. Then 
there is a computable subtree S of T such that @- is not a branch in S. 


Proof 

If ¢e is not a branch in 7’, we can use S = T. If ¢ is a branch in T, one of T(0) 
and T(1) will be inconsistent with ¢., since they are inconsistent themselves. 
(here 0 is the sequence of length 1 with entry 0). Assume that ¢- is inconsistent 
with T(0). 

Let S(c) = T(0*o). Then S is a subtree as desired. The other case is essentially 
the same. 
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3.4.2 Collecting Trees 


Now we will describe a property on computable trees that will ensure that if f 
is a branch in the tree and ¢f is total, then f is computable. 


Definition 3.4.6 Let T be a computable tree, e an index. 
T is e-collecting if for all finite sequences o, 7 and all x € N, if ge (x) and 
oe (x), then 


ge (x) = G2 (2). 


Lemma 3.4.7 Let T be a computable e-collecting tree and let f be a branch in 
T. If of is total, then of is computable. 


Proof 

We will give an algorithm for computing ¢/ (x) from 2. 

Since ¢f(x)|, there will be a 0-1-sequence o such that ge (x) |, and since T’ 
is e-collecting, the value on” (x) will be independent of the choice of o. Thus 
our algorithm will be: 

Search for a finite sequence o such that be (a) | and let the answer be the 
output of our algorithm. 


Remark 3.4.8 There is more information to be gained from this proof. We 
see that the function @/ itself is independent of f as long as it is total and f is 
a branch in an e-collecting tree. 


3.4.3 Splitting Trees 


We will now find a criterion that will ensure that f is computable in ¢f whenever 
f is a branch in a computable tree and $f is total: 


Definition 3.4.9 Let T be a computable tree, and let e be an index. 
We call T e-splitting if for all finite 0-l-sequences o and 7, if o and 7 are 
inconsistent, then there exists a number x such that 


ge (x)| , 62 (x) | with 62° (x) ¢ 2 (2). 


Lemma 3.4.10 Let T be an e-splitting computable tree , let f be a branch in 
T and assume that of is total. Then f is computable in of. 


Proof 
We will compute an infinite 0-1-sequence {k;};en from ¢f such that T(o,) is an 
initial segment of f for all n, where o,, = (ko,...,kn—-1). The empty sequence 


do of course satisfies this. Assume that o,, is constructed. Then one of T'(o,, *0) 
and T(o, * 1) will be an initial segment of f. We will just have to determine 
which one. Now $2°7"*)) and ¢2'7"*") will be inconsistent, so exactly one of 
them will be inconsistent with ¢/. We can then use ¢/ to find the inconsistent 
one, which means that we can decide which direction is along f and which is 
not. This provides us with the induction step, and we can move on. This ends 


the proof of the lemma. 
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Remark 3.4.11 In the case of T being an e-splitting tree, we see that of is a 
one-to one-function of the branch f, and what we just argued for is that we can 
compute the inverse. 


3.4.4 A minimal degree 


Using Lemmas 3.4.3, 3.4.5, 3.4.7 and 3.4.10 we can show the existence of a 
minimal degree from the following: 


Lemma 3.4.12 Let T be a computable tree and let e be an index. Then there 
is a computable subtree S that is either e-collecting or e-splitting. 


Proof 

Case 1: There is a sequence o such that for all 7 and 7’ extending o, oe” and 
ge) are equal where both are defined. Let S(7) =T(o*7). Then S will bea 
subtree of T and S will be e-collecting. 

Case 2: Otherwise. Then for every o there will be extensions 7) and 7; such 
that ge (7) and ge ™) are inconsistent. Further, we can find these 79 and 7) as 
computable functions to(o) and t1(c). 

We then define the subtree S by 


S(()) =T(Q), ie. S and T are equal on the empty sequence. 
If S(c) is defined, let S(o « 0) = T(to(c)) and S(a * 1) = T(ti(o)). 
This defines a subtree that will be e-splitting. 


We have now proved all essential steps needed in the construction of a min- 
imal degree: 


Theorem 3.4.13 There is a non-computable function f such that if g <r f 
then either g is computable or g is equivalent to f. 


Proof 
Using the lemmas above we construct a family {T,,} of computable trees such 
that T,+1 is a subtree of T,, for all n, and such that for all e: 


If ¢- is total, then ¢, is not a branch in To.41. 
Toe+2 is either e-collecting or e-splitting. 


Then by Lemma 3.4.3 there is an f that is a branch in all T,,’s, and this f will 
have the property wanted. 
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3.5 A priority argument 


3.5.1 C.e. degrees 


In the constructions of minimal pairs and functions of minimal degrees we have 
not been concerned with the complexity of the sets and functions constructed. 
We can decide upper bounds on the complexity of the objects constructed in the 
proofs by analyzing the complexity of properties like ‘dé, is total’ and counting 
in depth how many number quantifiers we will need in order to write out a 
definition of the object constructed. If we, however, are interested in results 
about degrees with some bounded complexity, we must be more careful in our 
constructions. In this section we will be interested in degrees with at least one 
c.e. set in it: 


Definition 3.5.1 Let a be a degree. 
ais anc.e. degree if a contains the characteristic function of a c.e. set. We say 
that f is of c.e. degree if the degree of f is a c.e. degree. 


There is a nice characterization of the functions of c.e. degree. We leave the 
proof as an exercise for the reader, see Exercise 3.26. 


Theorem 3.5.2 Let f be a function. Then the following are equivalent: 
i) f is of c.e. degree. 


ii) There is a primitive recursive sequence {fi}ien converging pointwise to f 
such that the following function 


g(x) = pnNm > n(fm(x) = f(z) 


is computable in f. 


3.5.2 Post’s Problem 


So far we only know two c.e. degrees, O, the degree of the computable sets and 
functions, and O’, the degree of the halting problem or of K. Post’s Problem 
asks if there are more c.e. degrees than those two. This is of course a nice, 
technical problem, but it has implications beyond that. One of the reasons 
why c.e. sets are so interesting is that the set of theorems in an axiomatizable 
theory is c.e. If there were no more c.e. degrees than those two known to us, a 
consequence would be that there are two kinds of axiomatizable theories, those 
that are decidable and those that share the complexity of Peano Arithmetic. 
As a consequence of Gédel’s proof of the incompleteness theorem, the set of 
Gédel-numbers of theorems in Peano Arithmetic is a complete c.e. set, and 
actually of even the same m-degre as K. 

Now, in 1957 two young mathematicians, Friedberg and Muchnic, indepen- 
dently constructed c.e. sets of in-between degrees. They both developed what 
is now known as the priority method. The problem we have to face when con- 
structing c.e. sets is that we must give an algorithm for adding elements to the 
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set, but we cannot give an algorithm for keeping objects out of the set. If we 
did that, the set constructed would become computable. Thus when we have 
made an attempt to approximate a set with positive and negative information, 
we must be allowed to violate the negative information. However, we must 
not violate the negative information to such an extent that we ruin our global 
goal. We solve this dilemma by introducing priorities to our requirements. If 
an effort to satisfy one requirement will ruin the attempt to satisfy another re- 
quirement, we let the requirement with highest priority win. This idea will work 
when two properties are satisfied by the construction: If we make an attempt 
to satisfy a requirement and we never ruin this attempt, we actually manage to 
satisfy the requirement. Further, if we after a stage in the construction never 
make an attempt to satisfy a requirement, the requirement will automatically 
be satisfied. 

Thus we are bound to satisfy the requirement of highest priority, either 
because we make an attempt which will not be ruined, or because there is no 
need to make an attempt. 

Then we are bound to satisfy the next requirement, either because we make an 
attempt after the final attempt for the first requirement, or because there is no 
need to make such an attempt, 

and so on.... In the finite injury lemma we will give a full proof along this line 
of arguing. 


3.5.3 Two incomparable c.e. degrees 


Post’s problem was solved by constructing two incomparable c.e. degrees a and 
b. We will see below why this actually solves the original problem. 


Theorem 3.5.3 There are two c.e. sets A and B that are not computable 
relative to each other. 


Remark 3.5.4 If A and B are not computable in each other, neither can be 
computable, because any computable set will be computable in any set. More- 
over neither can have the same degree as K, because every c.e. set is computable 
in K. Thus we have not just produced an in-between degree, but two in-between 
degrees. In Exercise 3.27 we will see that there are infinitely many in-between 
c.e. degrees, and that any countable partial ordering can be embedded into the 
ordering of the c.e. degrees. 


Proof 
We will construct two c.e. sets A and B satisfying 


Ro: N\AAWS 
Ree41t N\B x wa 


or in other terms: The complement of A is not c.e. relative to B and vice versa. 
If we achieve this for all e, we will have proved the theorem, since a set A is 
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computable in B if and only if both A and the complement of A are c.e. in 
B, and since A is c.e. we have that A is computable in B if and only if the 
complement of A is c.e. in B. 

We will construct two primitive recursive increasing sequences {A;,}nen and 
{Bn}nen of finite sets . We let 


Ap = Bo = 0. 


We call each step in the process a stage. If n = (1, e, 2) we will consider to make 
an attempt to satisfy Roe at stage n, while if n = (2,e,2) we will consider to 
make an attempt to satisfy Roe+1. 

An attempt to satisfy Ro. will consist of selecting aq © Anil Weet*, and 
then put up a protection, the set of points used negatively in the verification of 
qe Wer. If we can keep all objects in this protection out of B throughout 


the construction, we will have 
qe ANWP 


and Ro- will be satisfied. We call the protection active at a later stage m if By, 
is disjoint from this protection. 

There is some little minor trick to observe, we will use disjoint infinite sup- 
plies of numbers that we may put into A (or B) in order to satisfy Ro. (or 
Roe+1). We will use this to show that if we make only finitely many attempts, 
we will succeed after all. 


Now let n = (1,e,2) and assume that A, and B,, are constructed. Assume 
further that we constructed certain protections, some of them active at stage n, 
others not. We write the following procedure for what to do next: 

Let Bnii = Bn. 

Question 1: Is there a protection for R2e active at stage n? 

If the answer is ‘yes’, let An4i = An. and continue to the next stage. 

If the answer is ‘no’, ask 

Question 2: Is there a y < n such that ¢P»((y,e)) | and y is in no active 
protection for any requirement Roq41 where 2d +1 < 2e? 

If the answer is ‘no’, let A,41; = A, and proceed to the next stage. 

If the answer is ‘yes’, choose the least y, let An41 = An U {(y,e)}, construct a 
protection {0,...,n}\ B, for Ro_ and move on to the next stage. 


If n = (2,e,x) we act in the symmetric way, while for other n we just move on 
to the next stage, not adding anything to A or B. 
This ends the construction. 


Claim 1 (The Finite Injury Lemma) 

For each requirement R, there is a stage n, after which we do not put up or 
injure any protection for R,. 

Proof 

We prove this by induction on s, and as an induction hypothesis we may assume 
that there is a stage ms after which we never put up a protection for any 
requirement R; with t < s. Then after stage m, we will never injure a protection 
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for R,. Thus if we never put up a protection for R, after stage m, we can let 
Ns = ms, while if we construct a protection, this will never be injured and we 
can let n, be the stage where this protection is constructed. 


Now let A = Unen An and B= Unen Bn. Then A and B are c.e. sets. 
Claim 2 

Each requirement R, will be satisfied. 

Proof 


We prove this for s = 2e. There are two cases. 
1. There is a protection for R, active at stage ns. 
If this is the case, there will be a y such that (y,e) € An, A W232". Since the 
protection is not injured, we will have that (y,e) € ANW2 and the requirement 
is satisfied, A and W2 are not complementary. 
2. There is no such protection. 
There are only finitely many objects of the form (y,e) in A, because we ad at 
most one such object for each stage before n,, and never any at a stage after 
Ns 
On the other hand, there can be only finitely many objects of the form (y, s) in 
W.2, since otherwise we could choose one that is not in any protection for any 
R, for t < s, and sooner or later we would at some stage after n, make a new 
attempt to satisfy R,, which we are not. Thus 4U W8 contains only finitely 
many objects of the form (y,e) and the sets are not the complements of each 
other. Thus the requirement will be satisfied in this case as well. 

We have shown that all the requirements are satisfied in this construction, 
so the theorem is proved. 


3.6 Subrecursion theory 


3.6.1 Complexity 


What is to be considered as complex will be a matter of taste. Actually, the 
same logician may alter her/his taste for complexity several times a day. The 
generic logician may give a class on automata theory in the morning, and then 
the regular languages will be the simple ones, while context free languages are 
more complex. Still, context free languages are decidable in polynomial time, 
and while our logician spends an hour contemplating on the P = NP-problem 
any context free language is by far simpler than the satisfiability problem for 
propositional logic. If our logician is working mainly in classical computability 
theory, all decidable languages are simple, while the undecidable ones are the 
complex ones. If, however, our logician is a set theorist, all definable sets are 
simple, maybe all subsets of subsets of subsets of R are simple, we have to move 
high up in cardinality in order to find sets of a challenging and interesting com- 
plexity. 

In this section, our view on complexity will be one shared by many proof theo- 
rists. One of the aims in proof theory is to characterize the functions and sets 
provably computable in certain formal theories extending elementary number 
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theory. The idea is that if we know the functions provably computable in T,, we 
know something worth knowing about the strength of the theory T. 

We will not be concerned with proof theory in this compendium, and any 
references to proof-theoretical results should not be considered as a part of any 
curriculum based on this text. 


3.6.2 Ackermann revisited 


In Section 3.2.1 we defined the Ackermann branches. The idea of Ackermann 
was that each use of the scheme for primitive recursion involves an iteration of 
a previously defined function. Then diagonalising over a sequence of functions 
defined by iterated iteration would break out of the class of primitive recursive 
functions. 

The Ackermann branches were defined using functions of two variables. We 
will be interested in pushing his construction through to the transfinite level, 
in order to describe more complex functions from below. Then it will be conve- 
nient, from a notational point of view, to use functions of one variable. 


Definition 3.6.1 
e Let Fo(x) =a+4+1 


e Let Frii(z) = Fit? (2) 


In Exercise 3.30 we will see that diagonalising over the F),’s will lead us 
outside the class of primitive recursive functions. 


From now on in this section we will let PA be first order Peano arithmetic; 
i.e. elementary number theory with the axiom scheme for first order induction. 
Our first order language L will be the language of PA. 


Definition 3.6.2 A function f : N — N is provably computable if there is a 
formula A(x, y) = 4zB(a,y,z) in L where A defines the graph of f, B has only 
bounded quantifiers and 


PA VasyA(a, y). 


This definition is extended in the obvious way to functions of several variables. 


We will assume that the reader is familiar with the method of arithmetisa- 
tion, sequence numbering and so forth. 


Lemma 3.6.3 Let f(k,x) = Fy(x). Then f is provably computable. 


Outline of proof 

Essentially we have to prove that Fo is total and that if F;, is total then Fy+1 
is total as well. This induction step is proved by induction, where we actually 
must prove 


VaVyszF? («) = z 


by induction on y. 
In order to write a complete proof, we have to write down the formal definition 
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of the graph of F and the establish a proof tree for the formula required to be 
a theorem in PA. This is tedious and not very exiting; the exiting part is to 
decide how much induction that is required. 


3.6.3 Ordinal notation 


The concept of an ordinal number will be properly introduced in a course on 
set theory. We have defined the concept of well orderings. An ordinal number 
will be a set that in a canonical way represents an isomorphism class of well 
orderings. For our purposes it will be sufficient to think about order types of 
well orderings, and we will use a term language for such order types. Exercise 
5.11 is a self-service introduction to ordinal numbers. 


Definition 3.6.4 w is the smallest infinite ordinal number, and it denotes the 
order-type of N with the standard ordering. 


All natural numbers will be ordinals numbers as well. 

We may extend the arithmetical operations plus, times and exponentiation to 
operations on orderings, or actually, on order types. The sum of two orderings 
(A, <4) and (B,<g) will be the set 


C=A®B= ({0} x A) U ({1} x B) 
with the ordering <c defined by 
e (0,a) <c (1,b) whenever a € A and be B. 
e (0,41) <c (0, a2) if and only if ay <4 ae. 
e (1,b1) <c (1, be) if and only if b) <g be. 


The product of two orderings will for our purposes be the anti-lexicographical 
ordering on the product of the domains. 


The formal definition of the exponential of orderings is less intuitive, but it helps 
to think of exponentiation as iterated multiplication. Our definition only works 
for special orderings (A, <,), including all well orderings. 


Definition 3.6.5 Let (A,<,4) and (B,<g) be two orderings where A has a 
least element ag. 

Let p € C if p is a map from B to A such that p(b) = ao for all but finitely 
many be B. 

If p # q are in C, there will be a maximal argument b such that p(b) 4 q(b). 
We then let p <c ¢ = p(b) < q(b) for this maximal b. 


Lemma 3.6.6 If (A,<,) and (B,<g) are two well orderings then the sum, 
product and exponential will be well orderings. 
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The proof is left as Exercise 3.31 


As a consequence, every arithmetical expression in the constant w and con- 
stants for the natural numbers, and using ’plus’, ’times’ and ’exponents’, will 
have an interpretation as an ordinal number. 

The least ordinal number that cannot be described by an expression as above 
is baptized €y. As for ordinary arithmetics, w° = 1. This is a special case of the 


more general rule 
we wh = wrt? 


A consequence is that each ordinal a < €9 can be written in a unique way as 
= Wr Hoe MH 


where {Q0,:--,Qn} is an increasing (not necessarily strictly) sequence of ordi- 
nals less than a. We call this the Cantor Normal Form of a. If ag = 0 the 
ordinal a@ will be a successor ordinal, otherwise it will be a limit ordinal. 


We extended the Ackermann hierarchy to the first transfinite level by diago- 
nalising over the Ackermann branches. One advantage with the Cantor normal 
form is that we may find a canonical increasing unbounded sequence below ev- 
ery limit ordinal between w and €9. Actually, it is possible to do so for ordinals 
greater than €9 too, but readers interested in how this is done and why someone 
would like to do it are recommended to follow a special course on proof theory 
and ordinal denotations. 


Definition 3.6.7 Let 
a= wm tere + yy 


be given on Cantor normal form, and assume that ag > 0. 
We define the n’th element a[n] of the fundamental sequence for a as follows: 


Case 1 an = G+1: 
Let a[n] = wm +---+w% +n-wh, 


Case 2 ap is a limit ordinal: 
We may then assume that ao[n] is defined, and we let 


We may consider the map a+ a[n] as an extension of the predecessor function 
to limit ordinals: 


Definition 3.6.8 Let 0 <a < e. We define the n-predecessor of a by 


a) If a is a successor ordinal, the predecessor of a will be the n-predecessor 
of a. 


b) If a is a limit ordinal, a[n] will be the n-predecessor of a. 
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c) We say that @ <, a if 6 can be reached from a by iterating the n- 
predecessor map. 


Lemma 3.6.9 Let a <€0, BG <maandm<n. Then B <n a. 


Proof 
It is sufficient to prove that if m < n and a is a limit ordinal, then alm] <,, a. 
This is left as a non-trivial exercise for the reader, see Exercise 5.7. 


Lemma 3.6.10 Leta < €9 and let G@ < a. Then there is ann such that 3 <n a. 


Proof 
This is proved by induction on a with the aid of Lemma 3.6.9. The details are 
left as a nontrivial exercise for the reader, see Exercise 5.7 


3.6.4 A subrecursive hierarchy 


We will now extend the alternative Ackermann hierarchy to all ordinals less 
than €0: 


Definition 3.6.11 Let a < «9. We define Fy (x) by recursion on a as follows: 
e Fo(xz)=x2+1 
© Foyi(0) = Fe**(c) 
e Fy (#2) = Foje|(@) when a is a limit ordinal. 


Proof theorists have shown that if a function f : N — N is provably computable, 
then f will be bounded almost everywhere by one of the Fy’s where a < €p. 
The proof involves the translation of a proof in PA to a proof in w-logic, then 
a cut-elimination procedure in w-logic and finally an analysis of cut-free proofs 
in w-logic of the totality of computable functions. This analysis is far beyond 
the scope of this compendium, and the reader is not recommended to work out 
the details. 

In Exercises 5.7 and 5.8, the reader is challenged to prove that each Fy is 
provably computable and establish the hierarchical properties of the {Fa}a<eg- 
hierarchy. The main results are: 


Lemma 3.6.12 Letn <x andB<,a< oe. 
Then 
F(a) < Fa(2). 


Theorem 3.6.13 If 8 <a< €o, then 
daoVe > to(F a(x) < Fy(x)). 


Remark 3.6.14 There are many aspects about subrecursive hierarchies that 
we have not discussed in this section. We have not discussed complexity classes. 
For instance, the class Hy of functions computable in polynomial time relative 
to a finite iteration of fF represents a stratification of the set of all provably 
computable functions into a complexity hierarchy, where each complexity class 
is closed under composition and closed under polynomial time reductions. We 
will not discuss such matters further here. 
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3.7 Exercises 


Exercise 3.1 Prove that the following functions are primitive recursive: 


a) f(z,y,Z)=at+y 
b) f(a,y,Z)=a-y 
c) f(x,y, 2) = 24 


e) f(z,¥) =2-1 

f) f(x,y,Z)=2-y 

&) f(t,y,7) =0ife=y 
f(z,y,2) =lifefy 

h) f(a,y,2) =Oifa<y 


f(z,y,Z)=life>y 


Exercise 3.2 Prove the following facts: 


a) N” and @ are primitive recursive as subsets og N”. 


b) The complement of a primitive recursive set is primitive recursive. More- 
over, the union and intersection of two primitive recursive subsets of N” 
will be primitive recursive. 


Exercise 3.3. a) Prove Lemma 3.2.6. 


b) Prove that we can replace the inequalities by strict inequalities in Lemma 


3.2.6. 


Exercise 3.4 Prove Lemma 3.2.7. 


Exercise 3.5 a) Prove Lemma 3.2.11. 


b) Prove that the sequence numbering is monotone in each coordinate. 


c) Prove that the monotone enumeration SEQ of the sequence numbers is 


primitive recursive. 


Hint: Find a primitive recursive bound for the next sequence number and 


use bounded search. 


d) Define an alternative sequence numbering as follows: 
((£0,+++;%n—1)) is the number z such that 


SEQ(z) = (xo0,---,2n-1)- 


Show that this alternative numbering is surjective and still satisfies Lemma 


3.2.11 
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e) Prove that the pairing function P in Definition 3.2.13 is 1-1 and onto. 


Exercise 3.6 Let X be a set of functions closed under the schemes of primitive 
recursion. 
Show that for any function f : N— N we have 


fEXSfex. 


Exercise 3.7 We define the function f(k,e,y) by recursion on k and subrecur- 
sion on ¢ as follows: 


1. For all k, if e = (1) and y = (a, 21,...,2n) we let 
f(k,e,y) =ac4+1. 
2. For all k, if e = (2,7), y = (@1,...,%n) and 1 <i <n we let 
f(k,e,y) = x. 
3. For all k, if e = (3,q) we let 
fk, ey) = 4. 
4. For all k, if e = (4,e’,d1,...,dn) we let 
f(k,e,y) = fk. e', (f(k, di, y),-++£(K, dns). 
5. For all k > 0, if e = (5, e1, e2) we let 


* f(k,e, (0,21,...,2n)) = f(k — 1,e1, (11,...,2n)) 
** f(k,e,(m+1,21,..-;2n)) = 
f(k— 1, e2,(f(k,e, (m, 20,-..,2%n),™M,21,-..,Ln)) 


6. In all other cases, we let f(k,e, y) = 0. 
a) Prove that f is well defined, and that f is computable. 


b) Prove that if g of arity n is primitive recursive, there is a number k and 
an index e such that 


g(X1, tee , Ln) = f (ke, (x1, te ,2n)) 
for all 71,...,2%n € N”. 
c) Prove that f is not primitive recursive. 


Exercise 3.8 Prove that if ¢.(%) = y and ¢,(Z) = z, then y = z. 
Hint: Use induction on e. 
Discuss why this is something that needs a proof. 
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Exercise 3.9 Let {fn}nen be a sequence of total functions such that 


g(n,m) = fr(m) 


is computable. 

Show that each f, is computable, and that there is a total computable function 
not in the sequence. 

Hint: Use a diagonal argument. 


Exercise 3.10 Show that there is a total computable function ®(n, x) of two 
variables that enumerates all primitive recursive functions of one variable. 
Is it possible to let ® be primitive recursive? 


Exercise 3.11 Complete the proof of Lemma 3.2.27. 
Exercise 3.12 Prove Corollary 3.2.32. 


Exercise 3.13 a) Prove that every non-empty c.e. set is the image of a 
primitive recursive function (easy) and that every infinite c.e. set is the 
image of an injective computable function (not that easy, but still...). 


b) Prove that the range of a strictly increasing total computable function is 
computable. 


Exercise 3.14 Let A C N”. Show that A is c.e. (by characterization ii) or iii) 
in Theorem 3.2.35) if and only if 


AG bay (apie ta) se At 
is C.e. 


Exercise 3.15 Give an explicit description of the selection function in the proof 
of Theorem 3.2.39. 


Exercise 3.16 Let A and B be two disjoint sets whose complements are c.e. 
Show that there is a computable set C such that AC C and BNC =9. 
Hint: Use Corollary 3.2.40. 


Exercise 3.17 Let L be the language of propositional calculus over an infinite 
set {A;}ien of propositional variables. Discuss the following statement: 

There is a primitive recursive consistent set of propositions with no computable 
completion. 


Exercise 3.18 a) Show that there is an enumeration {In}ien of all closed 
rational intervals contained in [0,1] such that the relations I, C Im, InN 
Im = and |L,| < 27” are computable, where |J,,| is the length of the 
interval. 


A real number x is computable if there is a computable function h such 
that 
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i) |In(ny| < 27” 
ii) For all n, x € Incn) 


b) Let f : [0,1] — [0,1] be a continuous function. 
We say that f is computable if there is a total computable function g : 
N — N such that 
i) In © Im = Lg(ny S Lg(m) 
ii) rel,=> f(z) = Ig(n) 


iii) For all x and m there is an n with x € I, and |Ig(n)| < 27™. 


Show that any computable function will be continuous. Show that there 
is a computable function that does not take its maximal value at any 
computable real. 


Exercise 3.19 Show that if A =,, B and C =,, D, then AGC =, BO D. 
Show that if A <,, FE and B <,, E then A@ B <,, E. Show that A® B then 
represents the least upper bound of A and B in the m-degrees. 


Exercise 3.20 Show that the second recursion theorem can be extracted from 
the proof of the recursion theorem. 
Hint: Let F'(g,xz) be computable. Use the recursion theorem on 


f(e, 2) =e F(¢¢, 2). 


Let eg be the index obtained from the proof of the recursion theorem. 
Show that if ¢.,(a) = y and we make an oracle call for ¢., (z) in the computation 
of F(¢.,,x) then ¢.,(z) is a subcomputation of ¢¢, (2). 


Exercise 3.21 Show that if f; = fo, then the jumps fy; and f% are also Turing 
equivalent. 


Exercise 3.22 Make a direct construction and show that there are two total 
functions f and g such that f <r g and g &r f. 


Exercise 3.23 Prove that there is a minimal pair of Turing degrees, i.e. a pair 
{a,b} of degrees of non-computable functions, such that the degree O of com- 
putable functions is the greatest lower bound of a and b. 

Hint: You may use the main idea in the proof of Theorem 3.3.27, the present 
theorem is just simpler to prove. You may also get some ideas from the discus- 
sion of this proof in the text. 


Exercise 3.24 Let O be the degree of the computable functions. 
Recall the definition of the jump operator in the proof of Lemma 3.3.26, cfr. 
Exercise 3.21. We define the arithmetical hierarchy as follows: 


A X9-set is a c.e. set (of any dimension). 
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A II9-set is the complement of a 9-set 
A x2. ,-set is the projection of a II}-set 
A II}, ;-set is the complement of a ©}, ;-set 
A A? set is a set that is both ©? and HP. 
Let O\) be the degree obtained from O using the jump-operator n times. 


a) Prove that if A is ©? or A is IT?, then (the characteristic function of) A 
has a degree a < oO), 


b) Show that for k > 1 we have that A is D4 if and only if the degree of A 
is bounded by O), 
Hint: Use the relativized version of the fact that a set is computable if 
and only if both the set and its complement are c.e. 


Exercise 3.25 Show that there are continuumly many minimal degrees. 

Hint: Instead of constructing one tree T;, at level n we might construct 2” trees 
Ton for [h(o) =n, ensuring for each e that the branches of different trees will 
not be computable in each other via index e. The proof requires some clever 
book-keeping. 


Exercise 3.26 Prove Theorem 3.5.2. 


Exercise 3.27 Let B C N? beaset. For eachn €N, we let B, = {m; (n,m) € 
B} and we let B_, = {(k,m)€ B; k#n}. 


a) Show that there is a c.e. set B such that for all n, B, is not computable 
in B_y. 


Hint: Use an enumeration of N? to give all the requirements 
Rage NV oe we 
a priority rank. 


b) Consider a computable partial ordering < on the natural numbers. 
Show that there is an order-preserving map of < into the c.e. degrees. 
Hint: Use the construction in a), and let 
Cr ={(k,m) € B; k xn}. 


There is one computable partial ordering < such that any other partial ordering 
of any countable set can be embedded into <, see Exercise 5.2. Thus this shows 
that any countable partial ordering can be embedded into the ordering of the 
c.e. degrees. 


Exercise 3.28 Fill in the details in the proof of the following theorem: 


Theorem 3.7.1 Leta >O be anc.e. degree. Then there are two incomparable 
c.e. degrees b and c such thata=b@c. 
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This theorem is called The splitting theorem. We split the c.e. degree a into 
two simpler c.e. degrees. 

Proof 

Let A be a non-computable c.e. set. It is sufficient to construct two disjoint c.e. 
sets B and C such that 

A= BUC, A is not computable in B and A is not computable in C. 

Let f be a 1-1 enumeration of A. At each stage n we will put f(n) into B or 
f(n) into C, but not into both. 

Let B, be the numbers put int B before stage n and C,, be the set of numbers 
put into C before stage n. Further, we let 

An = {f(0),...,f(m—1)}, so An = Bn UC). 

We put up requirements 


Bogs Ky oe. 
Ree41 < Ka # bf. 


which we give priorities in the usual way. 
For each requirement R, we define three auxiliary functions. For s = 2e they 
will be: 


The match function 


m(s,n) = pk <nVa < k(¢2n (x) = Ky4,,(2)). 


The bar function 


b(s,n) = max{m(s,n’); n' <n}. 


The protection function 
p(s,n) = {y ; y is used negatively in computing b0n (a) for some x < 
b(s,n)}. 


In this case we call this a protection of B. 


Now the construction at stage n is as follows: If f(n) ¢ p(s, n) for any s < n, put 

f(n) into B. Otherwise, consider the requirement R, of highest priority such 

that f(n) € p(s,n). If this is a protection of B, we put f(n) into C, otherwise 

we put f(n) into B. 

When we put an element into a protection of B we injure that requirement. We 

will prove that for any requirement R, there is a stage n, after which we will 

never injure that requirement, and simultaneously that the bar-function b(s, 7) 

is bounded when s is fixed. 

Assume that this holds for all s’ < s. Then there is a stage n, after which f(n) 

is not in the protection for any s’ < s, and then, after stage n,, R, will not be 

injured. 

This in turn means that if x < 0(s,n) forn > ng and $2n(x)|, then $2 (x) = 
em (2). 

Now, if limy—o0 v(s,) = co we can use the increasing matching and the stability 
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of one to show that A is computable, which it is not. 

On the other hand, if K, = ¢? we will get increasing matching. Thus at 
the same time we prove that the construction of the bar and protection for R, 
terminates and that the requirement is satisfied at the end. 


Exercise 3.29 Post hoped to prove that a simple set cannot be of the same 
degree as the complete c.e. set K. This will not be the case, which will be clear 
when you have solved this problem. 
Let A be a non-computable c.e. set and f a total computable 1-1-enumeration 
of A. We know that f cannot be increasing (why’). 
Let 
B={n; 


m>n(f(m) < f(n))} 


This is called the deficiency set of the enumeration. 
a) Show that B is c.e. and that B is computable n A. 


b) Show that A is computable in B. 
Hint: In order to determine if x € A it is sufficient to find n ¢ B such 
that f(n) > a. 


c) Show that B is simple. 
Hint: If the complement of B contains an infinite c.e. set, the algorithm for 
computing A from B in b) can be turned into an algorithm for computing 


A. 


Exercise 3.30 Let F(k, x) = Fy(x) be the alternative Ackermann function. 
a) Show that « < F(k,x) for all k and x. 
b) Show that F is monotone in both variables. 


c) Let f : N” > N be primitive recursive. 
For % = (#1,...,%m), let SO># = v1 +--+ +2. Show that there is a 


number k such that 
f(@) < Fk, >, 2%) 


for all Ee N™. 

Hint: Use induction on the construction of f. 

In the cases f(x, #) = a%+1 and f(Z) = 2; you may use k = 0. 

In the case f(Z) = q, use a k such that F(k,0) > gq. 

In the case of composition, you may find it convenient to increase k by 
more than one, while in the case of primitive recursion, you should increase 
k by exactly one. 


d) Show that F is not primitive recursive. 
Hint: Show that G(k) = F(k,k) +1 cannot be primitive recursive, using 


c). 


Exercise 3.31 Prove Lemma 3.6.6 
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Exercise 3.32 Let P be the set of polynomials P(x) where we only use + (not 
*minus’) and where all coefficients are natural numbers. 


We order P by 


P(2) X Q(z) = AnVm > n(P(m) < Q(m)). 


Show that ~< is a well ordering of order-type w”. 
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Chapter 4 


Generalized Computability 
Theory 


4.1 Computing with function arguments 


When we introduced the Turing degrees, we first introduced the notion of rela- 
tivized computations via the notation 


bff (a4,...,¢m). 
By a small change of notation, we may view this as a partial functional 


O23 6585 Bip fis esd) 


where some of the inputs may be numbers and some may be functions. 


Definition 4.1.1 Let F : N™ x (NX)" SN. 
We say that F' is computable if there is an index e such that for all fe (NN)” 
and f € N™ we have that 


= 


F(a, f) = be(&, f). 
We will concentrate our attention to computable functionals F : NN = N. 


Let us first see why we may expect to face all relevant theoretical problems even 
then. 


Lemma 4.1.2 Let n > 0 and m > 0. Then there is a computable bijection 
between N™ x (NN)" and NN. 


Proof 

We will produce the bijections between (N%)? and NN and between N x NN and 
NN. The rest then follows by iterating the two constructions. 

Let (f,g)(x) = (f(x), g(x)) where we use a computable pairing function with 
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computable projections on N. 
Clearly (f,g) is computable from f and g in the sense that 


F(a, f,9) = (f,9)(2) 


is computable. In the same sense, f and g will be computable from (f, 9). 
If f ¢ NN anda EN, let 


(x, f)(0) =2@ 
(x, f)(y+1) = fly) 


This clearly defines a bijection, and it is computable in the sense above. 


4.1.1 Topology 


All these spaces are actually topological spaces, even metrizable spaces. The 
topology on N will be the discrete topology, and the natural metric will be the 
one where distinct numbers have distance 1. 

The topology on NN will be the standard product topology. Since there may be 
readers not familiar with the product topology, we give a direct definition. 


Definition 4.1.3 Let O C NN. O will be open if whenever f € O there is a 
number n such that for all g : N — N we have that g € O whenever f and g 
agrees for all arguments between and including 0 and n — 1. 


In more technical terms, this can be expressed by 


fEO=> AnVo(f(n) = G(n) > g € O). 


If o is a finite sequence of numbers, o determines an open neighborhood 
Bo ={f EN; f(IA()) =} 


i.e. the set of functions f extending o. These sets B, will form a basis for the 
topology. 


There is a close connection between topology and computability. There are 
many examples of this in the literature. Although the use of topology is re- 
stricted to fairly elementary general topology, it is very hard to read the current 
literature on computability on non-discrete structures without any knowledge 
of topology at all. In this section we will give one example of this connection. 


Theorem 4.1.4 Let F: NN —N. Then the following are equivalent: 
a) F is continuous. 


b) There is a function f : NN such that F is computable relative to f. 
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Proof 
First let F be computable relative to f, i.e. there is an index e such that 


F(g) = ¢(f,9) 


for all g. 

For a fixed g, the computation tree of ¢.(f,g) is finite, and thus application of g 
will only be used a finite number of times in the computation tree. Let n be so 
large that if g(x) occurs in the computation tree, then « <n. As a consequence 
we see that if h(n) = g(n) then the computation trees of de(f,g) and de(f,h) 
will be identical. The further consequence is that 


vh € N*(R(n) = g(n) > F(g) = F(A) 


and this just means that F’ is continuous. 

Now, let us prove the converse, and let F' be continuous. Let {on }nen be a 
computable enumeration of all finite sequences. Let X = {n; F' is constant on 
B,,,}. Let f be defined by 

f(n) =0ifn¢g X 


f(n) =m+1if F is constant m on B,,. 


Then for each g € N — N we have that 
F(g) = f(un.f(n) >0Aon <g)-1 (4.1) 


where o < g means that the finite sequence g is an initial segment of the infinite 
sequence g. 


4.1.2 Associates 


In the proof of Theorem 4.1.4 we constructed a function f such that f(n) > 0 
if and only if F is constant on B,,, and in this case, f(n) = F(g) +1 for all 
g € B,,,. Following Kleene, we call this f the principal associate of F'. This is 
of course an important concept, but from a computability theory point of view 
it is not completely satisfactory: 


Lemma 4.1.5 There is a computable functional F such that the principal as- 
sociate is not computable. 


Proof 
Let A= W. be c.e. but not computable, i.e. 


né€ AS IAmT(e,n,m) 


where T' is Kleene’s T-predicate. 
Define the computable F’ by 


e F(g) =O if dm < g(1)T(e, g(0),m) 
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e F(g) =1 otherwise. 


Let f be the principal associate of F’. Let v(n) = (n), i.e. the sequence number 
of the one-point sequence containing just n. F will be constant on By, if and 
only if n ¢ A, and then the constant value is 1, so we have 


n¢As f(v(n)) =2. 


Since v is computable and A is not computable, f cannot be computable. 


When we showed that any functional F computable in f will be continuous, 
we referred to the computation tree of ¢.(f,g). When we look at the example 
showing Lemma 4.1.5 we see that the computation tree for F(g) always will 
make use of g(0) and of g(1). Let f be defined by 


e If lh(o,) < 2, let f(n) =0. 


) 
e If lh(o,) > 2 and dm < o,,(1)T(e, on (0), m), let f(n) = 1. 
e If lh(o,) > 2 and Vm < o,,(1)=T(e, on (0), m), let f(n) = 2. 


Then Equation 4.1 will hold for this F and f. 
This leads us to the following definition: 


Definition 4.1.6 Let F : NX — N be continuous. 
An associate for F' will be a function f : N — N such that 


i) Ifo, < a» and f(n) > 0 then f(m) = f(n). 
ii) If f(9(m)) > 0 then f(g(m)) = F(g) +1. 
iii) Vg € NNAm(f(g(m)) > 0). 


It is then clear that any continuous functional will be computable in any of its 
associates via equation 4.1, and any computable functional will have a com- 
putable associate, see Exercise 4.2 

4.1.3. Uniform continuity and the Fan Functional 


It is well known that any continuous function on a compact metric space is 
uniformly continuous. In R, a set will be compact if and only if it is closed and 
bounded. We have a similar characterization for N®: 


Definition 4.1.7 a) We will consider the following partial ordering of NX : 
fg Vn(f(r) < g(n)) 


b) For f € NN, let Cp = {93 9 < fh. 


The following is left for the reader as Exercise 4.3 
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Lemma 4.1.8 Let ACN‘. Then A will be compact if and only if A is closed 
and AC Cy for some f. 


As a result, each continuous F' will be uniformly continuous on each Cy. Let us 
see what this actually means. The formal definition is that for any « > 0 there 
is a 6 > 0 such that for all g and h in Cy, if d(g,h) < 6 then d(F(g), F(h)) <e. 
The metric on N is trivial, so let ¢ = 4 and choose 6 > 0 accordingly. Choose n 
such that 2"-! < 6. Then g(n) = h(n) > F(g) = F(h) whenever g and h are 


in Cy. These considerations give us 


Lemma 4.1.9 Let F : NN — N be continuous. Then 


Vf € NNanVg € CyVh € Cp(9(n) = h(n) > F(g) = F(h)) 


We suggest an alternative proof in Exercise 4.4. 


Lemma 4.1.9 suggests that we may consider the operator ® defined by: 


Definition 4.1.10 The Fan Functional ® is defined on all continuous F : NN 
N and defines a function ®(F) : NY > N by 


&(F)(f) = ung € CyVh € Cy(g(n) = h(n) > F(g) = F(h)). 


Theorem 4.1.11 Let ® be the fan functional. Then ®(F) is continuous when- 
ever F is continuous. 


Proof 

Let f be given, and let a be an associate for F'. For each n there will be a finite 
set of sequences o of length n that are bounded by f. Using K6nig’s lemma 
we may find an n such that a(o) > 0 for each o in this set. This n can be 
found effectively in f and a, and will be an upper bound for ®(F, f). We only 
need information from f(n) in order to verify that this n is a good one, and we 
may compute ®(£)(f) from the information at hand. The details are left as an 
exercise for the reader. 


Remark 4.1.12 In some sense originally made precise by Kleene and indepen- 
dently by Kreisel, the fan functional is continuous, see Exercise 4.5. In the next 
section we will consider a concept of computation where we may accept even 
functionals as inputs. This begs the question if the fan functional is even com- 
putable. The answer to this is not unique, since there is no canonical choice 
of a concept of computability in this case. En Exercise 4.5 we give a positive 
answer to this question for one possible notion of computability. 

We characterized the continuous functionals as those having associates. In 
a way this is an old fashioned approach. In the current literature one often use 
domains as an entry to the theory of continuous functionals. We did not do so, 
because then we would have to introduce a lot of simple, but new, concepts. A 
reader interested in learning more about this kind of generalized computability 
is advised to consult some introduction to domain theory. 
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4.2 Computing relative to a functional of 
type 2 


A major step in the process of generalizing computability came when Kleene 
and others started to relativize computations to functionals of type 2 and higher 
types in general. For certain indices e, we have defined the partial function 
be(1,---;%n, fi,---; fm), and of course there is no harm in accepting function- 
als F,,...,F, as dummy arguments. The problem is how we, in any sensible 
way, can use F actively in a computation. 

The strategy will be that whenever we can supply F; with an argument f, 
then we get the value F;(f) from some oracle call scheme. 


Definition 4.2.1 Let UV: N?+1 x (NX)™ x (NN = N)*  N be given. 
Let @EN", fe (NY)™ and let F € (NN GN). 
By 

Ax.W (a, G, f. F) 


we mean the function g : N — N defined by 
g(x) = U(a, a, f, F). 


This makes sense even when 7 is a partial functional, but then Ax. V (2, 4, f, F) 
may not be total. 


Kleene suggested something equivalent to 


= 


Definition 4.2.2 We extend the definition of ¢,(@, f) to a definition of 
be (G, , F) by adding the clauses: 


x) Ife = (8,d,j) and 
is total, then 


xi) If e = (9) then Der aa 
be(d, a, f, F) = dalG, f, F). 


This is an example of what is generally called an inductive definition. A large 
part of generalized computability theory is about inductive definitions and the 
computational principles that are behind them. In this case, we have to assume 
that each element in an infinite set of subcomputations will terminate before we 
accept some computations to terminate. In order to handle this properly, we will 
have to introduce a generalized concept of computation tree, now a computation 
tree may be an infinite, well founded tree with countable branching at certain 
nodes. 

It will lead us too far in this introductory course to define all the concepts 
needed for a mathematically stringent handling of computability relative to 
functionals. As an appetizer, let us mention two kinds of problems: 
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1. What are the computable functionals of type 3? 
2. What are the functions computable in a fixed functional? 


Question 1 is actually hard to answer, there is no obvious characterization of 
this set. We know that the fan functional, which is defined only for continuous 
functionals of type 2, is not computable in the sense of Kleene. There are a few 
other positive and negative results about which functionals that are Kleene- 
computable, but nothing that is both general and informative. We know more 
about the answer to the second question, we will address this partly in the next 
section and partly as a small project discussed in Chapter 5. However, there 
is no complete and satisfactory characterization of the class of sets of functions 
that may turn out as the set of functions computable in some functional. 


Remark 4.2.3 Scheme ix) may seem a bit dubious, and was actually consid- 
ered to be a cheat. When we work with Turing machines or with Kleene’s 
definition of computations relative to functions, the existence of a universal al- 
gorithm is an important theorem. However, without scheme ix) we will not 
be able to prove the existence of a universal algorithm for computations rela- 
tive to functionals. We will follow Kleene, be pragmatic about it, and claim 
that including scheme ix) will give us a much more fruitful concept. However, 
introducing scheme ix) makes the two schemes v) for primitive recursion and 
vi) for the p-operator redundant. This, and other facts about computability in 
functionals, are discussed in the nontrivial Exercise 4.9. 


Definition 4.2.4 Let F be a functional of type 2, f a function of type 1. We 
say that f is Kleene-computable in F, f <K F, if for some index e we have that 


f(x) = be(a, F) 


for allz EN. = 
This definition is extended in the canonical way to the concept f <x f,F. 
We let the 1-section of F, 1— sc(F) be the set 


1— se(F) ={f 3 f <x Fh. 


The next sequence of lemmas should be considered as a small project on which 
the reader might write an essay: 


Lemma 4.2.5 If f <K fF and each fi; in f is computable in g, F,, then 
Fan G,F: 


Proof 
We may find a primitive recursive function p such that if 


> 


f(x) = ¢e(c, f, F) 


for all x and if 


for all y and i, then P 
f (2) = Pp(eseryeesem) Gs F) 


for all x. 


Lemma 4.2.6 Let F andG be functionals of type 2. Assume that F(f) = G(f) 
whenever f € 1—sc(F). Then 1— sc(F) =1-— sc(G). 


Proof 
By induction on the ordinal rank of the computation tree we show that if 
¢e(a,G) = a then ¢.(@, F) = a with the same computation tree. 

One of Kleene’s motivations for studying computations relative to function- 
als was to have a tool for investigating the computational power of quantifiers. 
Quantification over the natural numbers is captured by the discontinuous func- 
tional ?E defined by: 


Definition 4.2.7 Let 
1. 2E(f) =1 if da € N(f(a) > 0). 
2. 2E(f) =0 if Va € N(f(a) = 0). 


Definition 4.2.8 Let F and G be functionals of type 2. 


a) F <x G if there is an index e such that F(f) = ¢-(G, f) for all functions 
f:NON. 


b) F is normal if ?E <x F. 


The choice of the term ‘normal’ for these functionals reflects the focus of the 
early workers of higher type computability. We will investigate computability 
in ?E more closely in section 4.4. We end this section by showing that the 
1-section of a normal functional will be closed under jumps, see the proof of 
Lemma 3.3.26 for the definition. 


Lemma 4.2.9 Let F be a normal functional. Then 1 — sc(F’) is closed under 
jumps. 


Proof 
Generalizing Kleene’s T-predicate, we see that if g = f’, there is a computable 
predicate T’ such that 


g(a) =b — AnT(a,b,n, f). 


Given f and a we can use 7F to decide if there are b and n such that T(a, b, n, f) 
holds. 

If it does, we may search for the relevant b and output b+ 1. If it does not, we 
output 0. 
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4.3 7?E versus continuity 
Let f; be defined as 
fi(i) = 1. 
fi(d) = Oif 4 #5. 
Then lim;_.. f; is the constant zero function f while 7E(f) ¢ lim; 7E(fi). 


This shows that ?F is not continuous. Another way to see this is to observe 
that (?£)~+({0}) is not an open set, while {0} is open in N. 


We will now restrict ourselves to considering computations of $,(@, F’), i.e. com- 
putations in one functional argument and some number arguments. We do this 
in order to save notation, there is no theoretical reason for this restriction. 


We will define the n’th approximation $”(@, F’) to a computation of ¢.(G, F). 
There will be four properties to observe 


e $2 (F,@) will always be defined. 


e If ¢.(@,F)| and ¢.(4, F) = limn—o $2 (@, F’), then we can tell, in a uni- 
form way, from which no the limit is reached. 


e If $.(@, F)| and (a, F) 4 lim, .~ ¢%(a, F) we can compute 7E from F 
in a uniform way. 


e If ¢.(@, F’)| we can computably distinguish between the two cases. 


We will give most of the details, but the reader should be warned: The 
rest of this section is an example of how advanced arguments in computability 
theory might look like. (They often look more advanced than what they really 
are.) 


Definition 4.3.1 We define ¢2(a, F) following the cases for the definition of 
oo F’) ignoring schemes v), vi) and vii). 


= (1), we let $2(d, F) = ¢<(4, F). 
= (2,1), we let $2(@, F) = ¢e(d, F). 
If e = (3,q), we let 6"(a, F) = ¢-(a, F). 
If e = (4,e',dy,...,dm), let 
$°(a, F) =0. 
ge (Fa) = 6263, (4, F),--., 07, (@ F)). 
viii) If e = (8,d,1), then 
ge(a, F) =0. 
orth (@, F) = F(\r.62(a, a, F)). 
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ix) If e = (9), let 
o2(d, a, F) =0. 
GOT (d, a, F) = 05 (4, F). 
2 (a, F) = 0 in all other cases. 


Theorem 4.3.2 There are three partial computable functions 7, n and v de- 
fined on the natural numbers such that Op(c)(@,F'), bye) (GF) and $,,¢) (4, F) 
terminate whenever $-(a, F) terminates, and then 


© dn(e) (a, F) = 0 $(4, F) = limp. $2 (4, F). 


e If bc(G, F) = limp oo 2 (G, F) then $2 (a, F) = $-(G, F) whenever 
n= Pn(e) (4, F). 


LF) Alimyn 00 6" (4, F) then ?E is computable in F via the index 
F). 


) 


? 


Proof 

We will construct 7, 7 and v using the recursion theorem, so we will take the 
liberty to assume that we know the indices for these functions while defining 
them. 

We will not hide the intuition behind a technically accurate construction of 
these functions, but to some extent describe in words what to do and why it 
works. When we explain this, we will assume as an induction hypothesis that 
our construction works for computations of lower complexity. 

We will tell what to do when e corresponds to one of the schemes we have con- 
sidered. The ‘otherwise’-case is trivial, since we do not have to prove anything 
in this case, ¢. does not terminate on any input. 


If e corresponds to scheme i), ii) or iii), let m(e) be the index for the constant 
zero, n(e) the same, and we may without loss of consequences let v(e) also be 
the same. 


Let e = (4,e’,di,...,dm). 
Let 7(e) be the index for the following enumerated algorithm in (a, F’): 


1. If One) (G, F) = bn(d,) (4, F) Sees Pm (dm) (4; F) = 0, let bre) (4, F) =0 
and go to 2., otherwise let ¢,(-)(@, F) = 1 and go to 3. 


2. Let Pn(e) (4, Ee) =1+ maxt bn(er) (4, F), Pn(a) (4; Fy; paneg bn(dm) (G F)}. 


3. Select the least index d € {e’,d1,...,dm} such that $,(a)(@, F) 4 0. Then 
use the index ¢,q)(F,@) for ?E to decide if ¢.(@, F) = limp. 62 (4, F) 
or not. If it is, move to 4., otherwise to 5. 


4. We let @r(c)(@,F) = 0. We use 2E to compute the proper value of 
P(e) (G4, F). 


5. We let Pr(e) (4, F) => 1, and we let Pv(e) => Pv(d): 
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The case ix) is handled in the same way, so we restrict our attention to case 
viii): 
be(4, F) = F(Ax.¢a(2, G, F)). 

Let f(x) = ¢e(w, a, F) and let fu(e) = 62 (0,4, F) 
For each m € N, define g(x) by the following algorithm: 

1. Ask if f(a) = limp—oo fn(x). If yes, continue with 2., otherwise continue 

with 3. 
2. If F(f) = F(fn) for all n such that m < n < dbya)(a, F, a), let gm(x) = 


es («,2,F) (2). 
Otherwise choose the least n > m such that F(f) 4 F(fn) and let gn (x) = 


3. By the induction hypothesis, $,(a)(z,@, F) provides us with an index to 
compute 7E from F’. Use ?E to ask if 

n2=mF(f) 4 F(fn)): 

If not, let Gm(x) = f(x), otherwise let gm(x) = fn(a) for the least such n. 
In both cases, we will let gm(x) = f(x) if F(f) = F(fn) for all n > m, while 
9m(x) = fn(x) for the least counterexample otherwise. In the case 3. this is 
done explicitly. In case 2. we must use that if n > ¢,(d)(@, F), then f,(x) = 
Tb, Rice Fa)(@), a fact that follows from the induction hypothesis. 
Thus for each m we may computably decide if dn > m(F'(f) 4 F(fn)) by 
dn = M(EF(f) 4 F(fn)) & FF) = F(9m)). (4.2) 


From 4.2 we can decide if 


n(F(f) # FUfn))- 


If not, we let Pr(e) (4, F) = On(e) (G, F) =0. 

If there exists one such n we want to decide in a computational way if there are 
infinitely many of them or not. We may use the same kind of construction as 
that of gm(x) to compute a function h(x) such that h = f if F(f) A F(f,) for 
infinitely many n, while h = f,, for the largest n such that F'(f) 4 F(f,,) other- 
wise. We must split the construction into the same cases as for the construction 
of Gm(x), and we must rely on 4.2 when ?£ is not available. Then we get 


An > n(F(f) = F(fm)) & FUP) # F(h)). (4.3) 


In case both sides of equivalence 4.3 are positive, we can use equivalence 4.2 
and the j-operator to find ¢,,-)(@, F). It remains to show how to compute ?E 
from F in case both sides of equivalence 4.3 are negative. Actually, we will use 
the same trick one third time. Assume that F'(f) 4 F (fn) for infinitely many 
n. Let g be given. We want to decide 


dx(g(a) # 0). 


We construct f’(x) as follows: 
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e If f(x) A lim, fn(x) use 7E to decide if drz(g(x) # 0). If this is the 
case, select the least such 2, select the least n > x such that F(f) 4 F'(fn) 
and let f’(x) = fn(x). If it is not the case, let f’(x) = f(z). 


e Otherwise, ask if there is an x < ¢,a)(a, 4, F) such that g(x”) A 0. In 
both cases, do as above. 


Then 


x(g(x) #0)  F(f) 4 F(f')) 


and we are through. 
This ends our proof of the theorem. 


Remark 4.3.3 Our use of the recursion theorem is sound, but requires some 
afterthought by the inexperienced reader. What is required in order to make the 
definition of 7, 7 and v sound is to view them as Turing-computable functions 
operating on indices for computable functionals of higher types. We have given 
explicit constructions of @,(e) etc. with self reference, and analyzing exactly 
which combinations of the schemes that are required, we can express 7(e) etc. 
as functions of the Turing indices for 7, 7 and v. Then, by the recursion theorem, 
a solution exists. 


This theorem has an interesting corollary. A dichotomy theorem is a theorem 
stating that a general situation splits into two nice, in a sense opposite, cases. 
One case should then contain more information than just the negation of the 
other. We have observed that if F is a normal functional, then 1 — sc(F) will 
be closed under jump. This is actually a characterization, but in a very strong 
sense. Clearly any 1-section will be closed under the relation ‘computable in’. 
We defined a function to be of c.e.degree if it is Turing equivalent with the 
characteristic function of a c.e. set. This concept may of course be relativized 
to any function g. Finally, we say that a set Y of functions is generated from a 
set X of functions if Y consists of all functions computable in some g1,..., 9n 
from X. We then have 


Corollary 4.3.4 Let F be a functional of type 2. Then one of two will be the 
case: 


1. 2?E <x F, i.e. F is normal. 


2. There is some f € 1— sc(F) such that 1 — sc(F) is generated by the 
elements in 1 — sc(F) of c.e.(f)-degree. 


Proof 
If there is one terminating computation $.(d, F’) such that 


lim on (4, F) x be(G, F) 


n—oo 


then F' is normal by Theorem 4.3.2. 
Otherwise, let f((e,n, @)) = 02 (@, F). Clearly, f € 1—sc(F). Let g € 1—sc(F) 
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be given, g(x) = ¢-(x, F’) for all «. Then g is computable in f and the c.e.(f) 
set 


A={(x,n) ; Im > n(ge"(x, F) # b2(@, F))} 
which again is computable in F’ via Av.¢,(¢) (2). 


Kleene computability is extended to functionals of all finite types, and this 
dichotomy actually still holds. However, working with functionals in which ?E 
is not computable, one may as well work with what is known as the hereditarily 
continuous functionals. We will touch a little bit on this in section 4.5. 


4.4 The Hyperarithmetical sets 
Definition 4.4.1 A set is hyperarithmetical if it is computable in 7F. 


We have cheated a bit, and used a characterization due to Kleene as our defini- 
tion. For a more systematical introduction to hyperarithmetical sets and higher 
computability theory in general, see Sacks [3]. 

The term ‘hyperarithmetical’ indicates that this is an extension of the arith- 
metical sets in a natural way, and this is exactly what the intention is. The 
arithmetical sets will be the sets that can be defined by first order formulas in 
number theory. For each Gédel number a of a first order formula (x) with one 
free variable, we can, in a primitive recursive way, find an index 7(a) such that 


AL.Px(a) (x, °F) 
will be the characteristic function of 


{x ; ¥(x)}. 


Using the enumeration scheme ix) we can then find a subset B of N x N 
that is computable in ?E and such that each arithmetical set A is a section 
B, = {y ;(z,y) € B} of B. Then every set arithmetical in B will also be 
hyperarithmetical, and so forth. In a sense, we may say that ?E provides the 
‘arithmetical’ while the enumeration scheme provides the ‘hyper’. 


4.4.1 Trees 


If we want to analyze computations relative to functionals more closely, we need 
a precise notion of a computation tree. Although there is a common intuition 
behind all our concepts of trees, various needs require various conventions. For 
the rest of this section we will use 


Definition 4.4.2 a) A tree will be a nonempty set T of sequences of natural 
numbers, identified with their sequence numbers, that is closed under 
initial segments. The elements of T will be called nodes. 


b) (), ie. the empty sequence, is the root node of the tree. 
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c) A leaf node of a tree T will be a node in T with no proper extension in T. 


d) A branch in a tree T will be a maximal, totally ordered subset of T. If a 
branch is infinite, we identify it with the corresponding function f : N— N 
and if a branch is finite, we identify it with its maximal element. 


e) A decorated tree will be a tree T together with a decoration, i.e. a map 
f:ToN. 


f) If T; is a tree for each i € I, where J CN, we let (T;)icr be the tree T 
consisting of the empty sequence together with all sequences i * 0 such 
that 7 € I ando € Tj. 


g) Ifa eN and (7;, f;) are decorated trees for each i € I, we let 
(a, (Ti, fi) ier 
be the decorated tree (T, f) defined by 


— flixo) = fi(o) when i € I ando € Tj. 


A computation tree will be a decorated tree. Since it actually is the decorations 
that will be of interest, we sometimes take the liberty to identify them with 
the corresponding nodes with decoration in the tree. This liberty will be visible 
when we discuss the root and the leaves of a computation tree. 


Definition 4.4.3 Let ¢.(?E,@) = b. We define the computation tree of the 
computation by recursion as follows 


i) e = (1): Let the root node also be the only leaf node and be decorated 
with (e, @, b). 


ii) e = (2,2). Act as in case i). 
iii) e = (8,q). Act as in case i). 


iv) e = (e’,di,...,dm): Let (1, fi),..-,(Zim, fm) be the computation trees 
of ¢a,(?E,@) = ¢ resp. and let (Tm41, fm+1) be the computation tree of 
be (7E,C1,---;Cm) = d. 
Let the computation tree of ¢.(7E,@) = b be 


ed, yt 


i=1 


vii) e = (8,d): For eachi € N, let (T;, f;) be the computation tree of da(i, Z, ?E) 
and let the computation tree of ¢.(@,?E) = b be 


((e, a, b), (Ti, fi))ien- 
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The rest of the cases are similar, and the details are left for the reader. 


Lemma 4.4.4 Whenever ¢.(@,7E) terminates, then the computation tree will 
be computable in 7E. 


Proof 

We see how the computation tree is constructed from the computation trees 
of the immediate subcomputations. This construction is clearly computable in 
?F, and using the recursion theorem for computations relative to ?E we get a 
uniform algorithm for computing the computation tree from a computation. 


Recall that a tree is well founded if there is no infinite branch in the tree. It is 
easy to see that the computation trees of terminating computations will be well 
founded. 


Definition 4.4.5 A pre-computation tree will be a decorated tree that locally 
looks like a computation tree, i.e. each node will be of the form (d, a, b), the leaf 
nodes will correspond to initial computations, and other nodes relate to their 
immediate subnodes as in Definition 4.4.3. 


Lemma 4.4.6 =a) The concept of a pre-computation tree is arithmetical. 
b) A pre-computation tree that is well founded is actually a computation tree. 


c) If ¢-(@,?E) terminates, there is exactly one pre-computation tree with a 
root node on the form (e,G,b), and then b will be the value of ¢.(a,7E). 


Proof 

a) is trivial. In order to prove b) we observe that a subtree of a well founded 
pre-computation tree is itself a well founded pre-computation tree. Since facts 
may be proved by induction on the subtree ordering of well founded trees, we 
may use this kind of induction to prove the statement. The details are trivial. 
The set. of computations relative to ?E was defined by induction, and then we 
may prove facts by induction over this construction. c) is proved this way in a 
trivial manner. 


4.4.2 II?-sets etc. 


In this section we will let ‘computable’ mean ‘“Turing-computable’. 


Definition 4.4.7. a) A product set will be any product of the sets N and NN 
in any finite number and order. We define the arithmetical hierarchy of 
subsets of product sets X as follows: 


i) Aset A is a U9-set and a I$-set if A is computable. (In the literature 
you may find that A is supposed to be definable using only bounded 
quantifiers in order to be in these classes. This distinction does not 
matter for our applications.) 
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ii) A set AC X is D2, , if for some II? subset B of N x X we have that 


EEAS Ar € N((z,Z) € B). 
iii) A set AC X is II}, , if for some X2 subset B of N x X we have that 
EEASVa € N((z,Z) € B). 


iv) Aisa AQ-set if A is both II? and x2. 


b) We define the analytical hierarchy in the same fashion, but will only need 
the first level here: 


i) Aset AC X is Tit if there is some arithmetical set B C N‘ x X such 
that 
EAS VE eN((f,Z) € B). 


ii) Aset AC X is “1 if there is some arithmetical set B C NN x X such 
that 
EASA eN((f,Z) € B). 


iii) A is A} if A is both St and It. 


Lemma 4.4.8 Let X be a product set, AC X. Then A is X9 if and only if A 
is the domain of a partial computable function of arity X. 


Proof 
For product sets where all factors are N, these are two of the characterizations 
of c.e.-sets. The proof of the equivalence generalizes trivially to all product sets. 


In section 4.1 we showed how each product class either is homeomorphic to N or 
to N‘. We used pairing-functions (—, —) for pairs of numbers, pairs of functions 
or for a pair consisting of one number and one function for this. These functions 
can be used to show that multiple quantifiers of the same kind can be reduced 
to one, and that number quantifiers can be ‘eaten up’ by function quantifiers. 
For a complete proof, we need one more homeomorphism: 


Lemma 4.4.9 (NY)N is homeomorphic to NN. 


Proof 
We use the observation (NN) ~ NNXN ~ NN since Nx N & N. Precisely, if 
{fi}ien is a sequence of functions, we code it as one function by 


(fidien((r,m)) = fr(m). 
This is a computable bijection. 


All these coding functions will have inverses for each coordinate. We use (—); 
for the inverse at coordinate i, letting pairing functions have coordinates 1 and 
2. We can then perform the following reductions of quantifiers. Note that all 
the dual reductions will also hold. 
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Lemma 4.4.10 For each relation R the following equivalences hold: 


i) InamR(n,m) = AKR((k)1, (k)2). 
ti) APAGR(F, g) > ANR((A)1, (h)2)- 


iii) AfanR(f,n) — AgR((g)1, (g)2) ( Where the decoding is different from the 
one in ii).). 


iv) VnamR(n,m) & AfVnR(n, f(n)). 
v) Vnaf R(n, f) = AgVnR(n, (g)n)- 


The proofs are trivial and are left for the reader as Exercise 4.10. 


Lemma 4.4.11 a) The classes =°, II2, St and Il} are closed under finite 
unions and finite intersections. Moreover, the “-classes are closed under 
dn € N and the I-classes are closed under Vn € N. 


b) The classes II} and Xt are closed under number quantifiers. 


c) If4-normal form theorem 
If AC X is It, there is a computable set RC NN x Nx X such that 


FEASVSANR(f,n, Z). 


In the proof we use standard prenex operations and the reductions described in 
Lemma 4.4.10 The details are left for the reader, see Exercise 4.10. 


Theorem 4.4.12 Let T be one of the classes TI2, U2 (k > 1), II} or X} (or 
TU}, =} as defined in Exercise 4.11). For each product set X there is a universal 
T-set A in N x X, i.e. such that for each T-set BC X there is ann € N such 
that for all Ze X: 

FEBS (n, ZEA. 


Proof 

Let I be the dual of I, i.e. the set of complements of sets in I. Clearly, if the 
property holds for I’, it also holds for I’. Since the II- and ¥-classes are duals 
of each other, it is sufficient to prove the lemma for one of each pair. 

The existence of universal algorithms together with Lemma 4.4.8 ensures the 
theorem to hold for ©). Then any class I of sets definable from =9-sets using a 
fixed quantifier prefix will have universal sets for each product set. Each of our 
classes T is either one of these or the dual to one of these. For II} we need the 
normal form theorem, see Lemma 4.4.11 c). 


We will end this subsection by viewing the connection between II}-sets and well 
founded trees. 
Let A CN be a I}-set written in its normal form 


meAsSVfsAnR(m,n, f). 
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Since R is computable, there is, by the finite use property, a computable relation 
Rt on N® such that 


me ASVfanakRt (m,n, f(k)). 
For each m we let T,, be the tree of finite sequences a such that 
Vn < lh(a)Wr X oAR*(m,n,7T). 
Then Tj, will be a tree, and we will have 


m€A€#T,, is well founded. 


If we put a bit more effort into the arguments, we can actually show that we 
may choose Rt to be primitive recursive, this is connected with the relativized 
Kleene’s T-predicate, and thus each I} subset of N is m-reducible to the set of 
indices for well founded primitive recursive trees. On the other hand, this set 
is easily seen to be IT} itself. Thus the class of II+-sets contains an m-maximal 
element, just like the c.e. sets. 

In the next subsection we will se more analogies between the c.e. sets and the 
IIt-sets. 


4.4.3 Semicomputability in 7E and Gandy Selection 


Definition 4.4.13 A subset A C N is semicomputable in 7E if there is an index 
e such that for all a € N: 
b-(a,"E)\Sac A, 


where | still means ‘terminates’. 
Lemma 4.4.14 All TI} subsets of N are semicomputable in ?E. 


Proof 

Clearly the set of indices for computable trees is computable in 7F, so it is 
sufficient to show that the concept of a well founded tree is semicomputable. 
This can be done with a careful use of the recursion theorem: Let e be any 
index. Using the enumeration scheme and 7E we can define a computable 
function ®(e, 7’) such that 


e O(e,T,?E) = 0 if T consists of exactly the empty sequence. 


e O(c,T,7E) = 2E(And.(T)) if T is a more complex tree, where T;, is the 
set of sequences o such that nx a € T. 


By the recursion theorem for ?E, there is an index eg such that ®(e9,T,?E) = 
be, (T,7E) for all T. 

By induction on the well founded trees it is easy to se that ¢.,(T) will terminate 
whenever T is well founded. In order to prove the other direction, we must 
inspect the proof of the recursion theorem and see that in this particular case, if 
ge, (T) terminates, then either T' consists of the empty sequence only or ¢e, (Tn) 
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must terminate for each n and be a subcomputation. This is because we only 
used e as an index in connection with the enumeration scheme in defining ®. It 
follows that T must be well founded. 


Theorem 4.4.15 Let ACN. Then the following are equivalent: 
i) A is It. 
ii) A is semicomputable in 7E. 


Proof 

Lemma 4.4.14 gives us one direction. 

The set C of computation tuples (e, @, 6) such that ¢.(?E, @) = b is defined by a 
positive induction, i.e. there is a formula ®(x, X) such that atomic subformulas 
t(a) € X will occur positively and such that C is the least set such that 


C= {a; &(a,C)}. 


Then 
cE€CeVB(BC {a; O(a, B)} > cE B). 


Remark 4.4.16 This proof is trivial provided the reader has been through a 
quick introduction to inductive definitions. 
In Exercise 5.9 we offer a self-service introduction. 


Now is the time to assume that the reader is familiar with well orderings and 
preferably with ordinal numbers. For readers unfamiliar with this, we offer 
Exercise 5.11, a guided self-service introduction to the topic. 


Definition 4.4.17 Let T be a well founded tree on N. 


a) The rank |T| of T, is the ordinal rank of T seen as a well founded relation, 
whereo <TST Xo. 


b) The rank |o|r of o € T, is the value of the rank function for T on o. 


c) If T is not well founded, we let |7'| = co considered to be larger than any 
ordinal number. 


Theorem 4.4.18 There is a two-place function ® partially computable in ?E 
such that o(T,S) terminates exactly when both S and T are trees, and at least 
one of them is well founded, and then 


1. If |T| <|S| then ®(T,S) =0 
2. If |S| <|T| then ®(T, 8) =1. 


Proof 

We will use the recursion theorem for 7£, and the argument is an elaboration 
on the argument showing that the set of well founded trees is semicomputable 
2 

in “E. 
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We define ®(S,T) by cases, using self reference, and the solution using the 
construction behind the recursion theorem will give us the result. The set of 
trees is computable in 2, so for the sake of convenience, we assume that. both 
T and S are trees. Then 


e If T consists of only the empty sequence, let ®(7,S') = 0. 


e If S consists of only the empty sequence, but T contains more, let ®(T, S) = 
1. 


e If neither T nor S contains just the empty sequence, let 


B(S,T) = 0 Vnam(G (Tp, Sm) = 0), 


where we use 7E to decide this, and let 1 be the alternative value. 


It is easy to see by induction on the rank that if one of the trees is well founded, 
then ® does what it is supposed to do. If neither T nor S are well founded, 
®(T,S) will not terminate. This is however not important, and we skip the 
argument. 


Since every terminating computation is associated with a well founded compu- 
tation tree, we may think of the rank of the computation tree as a measure of 
the length of the computation. 


Definition 4.4.19 Let ¢.(?E,@) =. The length |(e, @,b)| of the computation 
will be the ordinal rank of the corresponding computation tree. If (e, @, b) is not 
the tuple of a terminating computation, we let |(e, @, b)| = co. 


Theorem 4.4.18 has an interesting application, the Stage Comparison Theorem: 
Corollary 4.4.20 There is a function V partially computable in 2E such that 


whenever o = (e,@,b) and o’ = (e’,a’,b’) are tuples, then U(o,0') terminates if 
and only if at least one of o and o' is a genuine computation tuple, and then 


WU(o,0') =0 = |o| < Jo’. 


We have proved a selection theorem for ordinary c.e. sets, the selection was 
performed by searching for the least pair where the second element was a witness 
to the fact that the first element was in the c.e. set in question. Gandy showed 
how we may combine this idea and the ordinal length of computations to prove 
a selection theorem for computations in 7 E: 


Theorem 4.4.21 (Gandy Selection) 
Let AC NXN bec.e. in?E. Then there is a selection function for A computable 
in 7B. 


Proof 
We will use the recursion theorem for ?E. Let 


A= {(a,b); $e,(a, b,?E)L}. 
Let f be computable in 7F satisfying 
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1. f(e,a,k) =0 if |(eo, a, k)| < |(e,a,k + 1)]. 
2. fle,a,k) = fle,a,k+1) +1 if de(a,k +1,2B) | and [(e,a,k + 1)| < 
(eo, a, k)| 


By the recursion theorem there is an index e; such that 
f (er, a, k) = de, (a, k, 2B). 


We claim that \a.¢¢, (a, 0,7) is a selection function for A. Let a € N be given. 
First observe that if ¢., (a, k,2E)| then f(e, a, k) will terminate, so ¢.,(a,k,?E)|. 
Moreover, observe that if d¢,(a,k,7E)| then ¢¢,(a,k’,2E)| whenever k’ < k. 
Thus 

Akbes (a, k,*E)l=> be, (a,0,7E) . 


Now assume that ¢¢, (a,0,?E)|. If we look at the computation of ¢.,(a,0,?E) 
we see that we will have ¢.,(a,1,7F), ¢¢,(a,2,?F),.... as subcomputations as 
long as part 2 of the algorithm for f is followed. The ranks of these computations 
will be a decreasing sequence of ordinals, and this sequence must come to an 
end. It comes to an end exactly when we hit a ko such that 


(eo, a, ko)| < |(e1, a, ko + 1). 


Then (a, ko) € A and backtracking the value of ¢., (a, k’,?E) for k’ < ko we see 
that @e, (a, k’,?E) = ko — Kk’. 
Consequently Aa.¢¢, (a, 0,2) will be a selection function for A 


Remark 4.4.22 This proof is of course uniform in eg. 
Corollary 4.4.23 Let ACN. Then the following are equivalent: 
i) A is computable in 7E. 
ii) Both A andN\ A are semicomputable in 7B. 


Proof 
Clearly, if A is computable in 7, then both A and its complement will be 
semicomputable in 7. 
Now assume that both A and N \ A are semicomputable in 7. 
Let 
B={(0,n); n€ A}U{(1,m); m¢ A}. 


B is semicomputable, and B is the graph of a function. By Gandy selection this 
function must be computable in 7. The result follows. 


4.4.4 Characterising the hyperarithmetical sets 


There is almost nothing left for us to do in this subsection. We have shown that 
a set A is semicomputable in 7 if and only if it is II}. We have shown that a 
set B is computable in 7E if both B and its complement are semicomputable 
in ?E. This gives us 
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Corollary 4.4.24 A set is hyperarithmetical if and only if it is At. 


Remark 4.4.25 Our proof of Corollary 4.4.24 is flavored by computability the- 
ory, but there are alternative proofs in the literature. The result, in a different 
form, goes back to Suslin in 1917. He essentially showed that a set A C NN is 
Borel if and only if both A and its complement are projections of closed sets in 
(N¥)?, i.e. that relativized At is the same as Borel. 


There are numerous analogies between the pair (Computable, computably enu- 
merable) and the pair (At, I+). Some of these are left for the reader as Exercise 
4.12. 

There is a close connection between hyperarithmetical theory and fragments of 
set theory. For readers familiar with axiomatic set theory, we offer a self-service 
introduction in Exercise 5.12. 


4.5 Typed A-calculus and PCF 


Kleene’s definition of computations relative to functionals can be extended to 
functionals of even higher types than two. This will, however, be a too spe- 
cialized topic to be introduced in this text. Kleene’s concept turned out to be 
useful in definability theory, the theory of what may be defined with the help 
of certain principles, but we have moved quite a bit away from what what we 
might call “genuine computability” . 

The concept of higher type computability is nevertheless of interest also when 
genuine computability is the issue, e.g. in theoretical computer science. In this 
section we will give a brief introduction to PCF. PCF is a formal programming 
language for computing with typed objects. It has its roots in work by Platek, 
Scott developed the first version of it as a formal logic for computability and 
Plotkin gave it the form we will be investigating. The intuition should be that 
we are operating with hereditarily partial, monotone and continuous functionals. 
We will explain this better when needed. 


4.5.1 Syntax of PCF 


Definition 4.5.1 We define the formal types, or just types as a set of terms for 
types as follows: 


1. v and o are types. These are called the formal base types. 
2. If o and 7 are types, then (0 — T) is a type. 


Remark 4.5.2 When we give a semantical interpretation of PCF’, we will as- 
sociate a mathematical object to each formal type. We think of v as denoting 
the set of natural numbers, o as denoting the set of boolean values and (a — rT) 
as denoting the relevant set of functions mapping objects of type o to objects 
of type rT. Since we will use these objects to interpret algorithms, we have to 
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interpret nonterminating algorithms as well. We will do so by including an el- 
ement for ‘the undefined’ in the base types, and carry this with us for higher 


types. 


As usual the term language will consist of variables, constants and combined 
terms. What is new is that each term will be typed, that some constants are 
treated as function symbols, and that we need type matching when forming 
combined terms. 


Definition 4.5.3 The terms in PCF are inductively defined as follows: 


1. Variables: 
For each type o there is an infinite list x7 of variables of type a. 


2. Constants: 
In PCF we have the following typed constants: 


ky of type e for each natural number n. 

tt and ff of type o. 

Z of type lo. 

S and P of typet— et. 

D, and D, of types o > (t > (4 — t)) and o > (0 = (0 = 0)) resp. 


Y, of type (o — 0) — a for each type o. 
3. Combined terms: 


If M is a term of type o > 7 and N is a term of type a, then (MN) 
is a term of type 7. This construction is called application. 


If M is a term of type 7 and 2 is a variable of type o, then (Av.M) 
is a term of type 0 > T. 
This construction is called abstraction. 


The intuition behind these constants and terms are as follows: 

0 will denote the zero-element in N, tt and ff the two truth values. 

Z will test if a number is zero or not. 

S and P are the successor and predecessor operators on the natural numbers. 
> will select the second or third argument depending on the boolean value of 
the first argument. Thus there will be one for each base type. 

Y, will denote the fixed point operator. The idea is that whenever f : a0 — a, 
then f will have a least fixed point a of type 7. Our challenge will be to formalize 
this in the formal theory and to find mathematical interpretations of these types 
such that this makes sense. 

Application (MN) simply mean that M is thought of as a function, N as an 
argument and (MN) then just denotes the application. It may be confusing 
that we are not using the more conventual notation M(N), but this has turned 
out to be less convenient in this context. 
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If M isa term of type 7, M may contain a variable x of a type o. The intuition is 
that M denotes an unspecified object of type 7, an object that becomes specified 
when we specify the value of x. (Av.M) will denote the function that maps the 
specification of x to the corresponding specification of M. 


The reader may have noticed that we have omitted a few parentheses here and 
there. We will do so systematically, both for types and terms. 

If f is a function of type (a — (7 — 4)) we will view f as a function of two 
variables of types 0 and 7. We will write 0,7 — 6 for such types. If then M is 
of type o0,t — 6, N is of type o and K is of type r we will write MNK instead 
of (((M7)(NV))(c)).This means that the application is taken from left to right. 


Lemma 4.5.4 Each type will be of the form o = 71,...,T™™ — 6 where n > 0 
and b is one of the base types. 


The proof is easy by induction on the length of a seen as a word in an alphabet. 


With this convention we see that the type of D, is 0,1, — v and the type of D, 
is 0,0,0 — o. We then view these as functions of three variables. 


4.5.2 Operational semantics for PC'F 


An operational semantics for a language designed to describe algorithms will be 
the specification of how to carry out step-by-step calculations or computations. 
The operational semantics for PCF will be a set of rules for how to rewrite 
terms in order to ‘ compute’ the value. This is of course most relevant when we 
are dealing with combined terms of base types. 


In PCF we have a similar distinction between free and bounded occurrences of 
variables as in first order logic, x becomes bounded in Ax.M. As for first order 
languages, one term N may be substituted for a variable x in a term M if no 
variables free in N becomes bounded after the substitution. We write Mx, for 
the result of the substitution, always assuming that N is substitutable for x in 
M. 


Definition 4.5.5 We define the relation —-, denoted by a long arrow, as the 
reflexive and transitive closure of the following one-step reductions (we assume 
that the typing is correct): 


Zkp — tt. 

Zina — ff. 
Pkysi — kn. 
Skn — kn41- 


Dp tt N — M where b is a base type. 
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Dp ff MN — N where b is a base type. 
(Az.M)N — Mx. 

Y,M — M(Y,M). 

M — M' > MN — M'N. 

N — N'> MN — MN’. 
ArM — ryM;. 


The last item tells us that we may replace a bounded quantifier with another 
one not occurring in M. As an example we will see how we may compute f(x) 
= 2x on the natural numbers: 


Example 4.5.6 The function f(2) = 2 is defined by primitive recursion from 
the successor operator as follows: 


© #(0) =0 
© f(S(x)) = S(S(f(a)). 
For the sake of readability, let g be a variable of type « — +e. Consider the term 
M of type s > u defined by 
M=Y,..,Ag.Au'(D, (Zr) ko S(S(g(P2)))). 


If we look at the expression Ag.Ax'(D (Zx)koS(S(g(Pz)))) we see that to each 
g we define a function that takes the value 0 on input 0 and g(a — 1) + 2 for 
positive inputs x. The least fixed point of this operator is exactly the function 
f. The verification of e.g. Mkz —> kg is a lengthy process. 


Example 4.5.7 We will show how the p-operator can be handled by PCF, 
i.e. we will define a term of type (4 — +) — v that must be interpreted as the 
[-operator. 

Note that F'(f) = ux.f(x) = 0 can in the same sense be defined in the following 
way: F(f) = G(f,0) where 


G(f,k) =0 if f(k) =0. 
G(f,k) =G(f,k+1) +1 if f(k) > 0. 


G will be the fixed point of a PC F-definable operator, and then the juz-operator 
is definable. 


The conditionals >, and D, can be extended to conditionals D, for all types a, 
see Exercise 4.13 


Definition 4.5.8 Let f : N—N bea partial function. We say that f is PCF- 
definable if there is a term M of type v — « such that the following are equivalent 
for all numbers n and m: 
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1. Mkyn — km 
2. f(k) is defined and f(n) =m. 


We used the expression ‘PC'F-definable’ also for the y-operator, trusting that 
the reader accepts this yet not clarified terminology. We then have 


Theorem 4.5.9 The PCF -definable functions are exactly the partial computable 
ones. 


Proof 

We have shown how to handle the p-operator and indicated how to handle 
primitive recursion. The full details are then easy and are left for the reader as 
Exercise 4.14. 


4.5.3 A denotational semantics for PCF 


The operational semantics for PCF explains how we may rewrite a term in 
such a way that we eventually may read off a number or a boolean value as 
the interpretation of the term. When we give a denotational semantics, we 
will interpret each term as a mathematical object in some set. One important 
aspect will be that the reductions of the operational semantics should not alter 
the denotational interpretations. If we look at the term M for the f(x) = 22 
function, we obtain that Mk2 —> k, in the operational semantics, while we 
want [Mk2] = 4, where we will use [-] for the denotational semantics. 


For the rest of this section, we leave out essentially all details. It is not expected 
that the reader will be able to fill in the details without consulting a textbook 
on domain theory or some other introduction to a similar topic. Most lemmas 
etc. will be left without proof. 

Consider the term N = Y,Aa.S(x). If we try to evaluate Nko we see that we 
get an infinite reduction sequence. Thus the only sensible way to interpret this 
term is by accepting ‘undefined’ as a possible value, and then using it in this 
case. 


Definition 4.5.10 Let N,; and B, be the set N of natural numbers and the 
set B of boolean values {true, false} extended with a new element 1 for the 
undefined. 


We will interpret each type o as a set D(c), and we let D(t) = N, and D(o) = 
By. 

Each of these sets will be partial orderings by letting | be the smallest element 
and the rest maximal elements that have no ordering between them. Such 
ordered sets are called flat domains. 


Definition 4.5.11 A partial ordering (D,C) is called bounded complete if 
1. Each bounded set has a least upper bound. 


2. Each directed subset X C D is bounded, with least upper bound LLX. 
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It is easy to see that N, and B, both are bounded complete. 


Definition 4.5.12 Let (D,Cp) and (£,Cg) be two partial orderings that are 
bounded complete. 

A function f : D > E is called continuous if f is monotone and for all directed 
subsets X of D we have 


UpxX = Un{f (x) ; rE X}. 


Lemma 4.5.13 Let (D,C) be bounded complete and let f : D — D be contin- 
uous. Then f has a least fixed point in D. 


Proof 
The empty set has a least upper bound, which we call | p. 
Then 


Lp En sep) Gp SLp)) Ee 
The least upper bound of this sequence will be the least fixed point of f. 


Definition 4.5.14 Let (D,Cp) and (E, C2) be two orderings that are bounded 
complete. 

Let D => E be the set of continuous maps from D to E. 

If fe D—Eandge D— E, we let f Cpe g if 


Va € D(f(x) E g(2)). 


Lemma 4.5.15 Let (D — E,Cp_.f) be as above. This partial ordering will be 
bounded complete. 


Not mentioning the ordering, we now interpret each type o by recursion on o 
as follows: 
D(a > T) = D(o) > D(r). 


What remains to be done can briefly be described as follows: 


1. An assignment will be a map from a set of typed variables x? to elements 
of D(a). The set of assignments can be viewed as elements of cartesian 
products of some D(c)’s, and will be ordered in a bounded complete way 
by the coordinate-wise ordering. 


2. Each term M of type o will be interpreted as a continuous function [M] 
from the set of assignments to D(a). It will sometimes be convenient 
to consider assignments restricted to the free variables of M, sometimes 
convenient to accept dummy variables. 


3. We must show that the least fixed point operator is a continuous map 
from (D,C D) to D. This is used to define [Y,]. 


4. We must show that application is a continuous map from (D > E) x D 
to E. This is used to define [MN] from [M] and [N]. 
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5. We must show that abstraction is a continuous map from D x E > F to 
D— (E = F). This is used to define [Av.M] from [M]. 


6. We must give interpretations to the constants k,, S, P, Z, D, and Do. 
This can easily be done by the reader. 


7. We must show that if M —> N then [M] = [AN]. The only case that 
requires some work is the reduction 


(At.M)N — Mx 
where we must show by induction on M that 


[M1 (sfivq(s)) = Mn I (s).- 
The proof by induction is not very hard. 


The denotational semantics for PCF that we have given is the one used 
originally by Scott when he formed the logic LCF that was turned into the 
programming language PCF’ by Plotkin . There are however alternative ways 
of interpreting PC’F-terms reflecting interesting aspects of PCF’. One conse- 
quence of the existence of a denotational semantics is that a closed term of type 
. cannot be interpreted as two different numbers. Consequently, though the use 
of —> is a non-deterministic process, there is no canonical way of performing 
the next step, we do not risk to obtain different values to the same term. 

We end this section by stating the converse, due to Plotkin, without any hints 
of proof: 


Theorem 4.5.16 Let M be a closed term og type v. If [M] =n, then M — 
kn. 


4.6 Exercises to Chapter 4 


Exercise 4.1 A continued fraction is a finite or infinite tree of fractions 


1 


1 
l+nit+tpasEe 


l+no-4 


A finite continued fraction will be a rational number while the value of an 
infinite continued fraction is defined as the limit of the finite subfractions. This 
limit will always exist. Why? (This part is not an exercise in logic, rather in 
elementary analysis.) 


a) Show that if 0 < a@ < 1, then there is a unique continued fraction with 
value a. 


b) Show that the continued fraction of a will be infinite if and only if a is 
rational. 
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c) Show that the bijection obtained between the irrational elements in [0, 1] 
and NN using continued fractions is a homeomorphism. 


d) Discuss in which sense we may claim that the homeomorphism in c) is 
computable. 


Exercise 4.2 Show that if F is a computable functional of type 2, then F' will 
have a computable associate. 


Exercise 4.3 Prove lemma 4.1.8. 


Exercise 4.4 Let F : NN — N be continuous, and let T(F, f) be the set of 
finite sequences o bounded by f such that F is not constant on B,. 

Show that T(F, f) will be a tree of sequences, and use Konig’s lemma to show 
that T(F, f) is finite for each f € N%. Use this to give an alternative proof of 
Lemma 4.1.9. 


Exercise 4.5 Show that there is a continuous function @ : NY — NN such 
that whenever a is an associate for a continuous F : NN > N, then ®(a) is an 
associate for ®(F). 

Show that ® can be chosen to be computable in the sense that 


G(x, a) = (a)(z2) 
is computable. 


Exercise 4.6 Tait showed that the fan functional is not Kleene-computable. 
Fill out the details in the following proof of Tait’s result: 


1. A quasi-associate for F is a function a : N — N such that 


e a(n) =m+1=> F(f) =m when o,, ~ f. 
e a(n) > OA Gy < Om => a(m) = a(n). 
e If f € 1—sc(F) then for some n, o, < f and a(n) > 0. 
If ¢-(@, F)| and a is a quasi-associate for F’, then there is some n such 


that whenever G has an associate extending (a(0),...,a(m)) and ¢.(4, G)| 
then 


be(G, G) = be (4, F). 
Hint: Use Use induction on the ordinal rank of the computation tree for 


be(G, Ee 


2. There is a quasi-associate for the constant zero function 70 and a non- 


computable f : N — {0,1} such that a(f(n)) = 0 for all n. 


3. For any finite part a(n) of a there is an associate @ extending a(n) for a 
functional that is not constant on {0,1}. 
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4. Combining 2. and 3. we obtain a contradiction from the assumption that 
the fan functional is computable. 


Exercise 4.7 If F: NN +N, let Fi(f) = F((i, f)). 
Show that there is a total functional [ satisfying the equation 


(PF) = Fo(Ar.T(Fi41)), 


and that we may compute ['(F’) uniformly from an associate for F’. 
The historical interest is that [ is not computable in the fan functional. This 
is too hard to be considered as an exercise. 


Exercise 4.8 Work out detailed proofs of Lemmas 4.2.5, 4.2.6 and 4.2.9. 
This cannot be seen as a simple exercise, but rather as a minor project. 


Exercise 4.9 We reduce the definition of ¢.(4, f.F) by ignoring schemes v) 
and vi), and technically work with a subsystem. 


a) Prove the $7"-theorem for computations with function and functional ar- 
guments as well as number arguments. 


b) Prove the recursion theorem for computations with function and functional 
arguments as well as number arguments. 


c) Use the recursion theorem to show that scheme v) is redundant in the 
original definition, i.e. that the class of functional computable in the 
subsystem is closed under primitive recursion. 


d) Use the recursion theorem to show that scheme vi) is redundant in the 
original definition, i.e. that the p-operator is definable in the subsystem. 
Hint: You may look at how the p-operator is defined in PCF. 


Exercise 4.10 Prove Lemma 4.4.10 and Lemma 4.4.11 in detail. 


Exercise 4.11 We actually have a hierarchy for sets definable by second order 
formulas as well. Let 


e ACX is II},, if for some Uj set BC NY x X, 
ZEASVE CN((f,Z) € B). 
e ACX is Ui, , if for some Ij, set BC NY x X, 


FEASAfeNN((f,z) € B). 


a) Prove that the class of II} sets are closed under finite unions and intersec- 
tions, number quantifiers and universal function quantifiers. Prove that 
there are universal I} sets for any k > 1 and any dimension. 
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b) Formulate and prove the analogue results for the X}-classes. 


Exercise 4.12 a) Let f : N — N be computable in 7F. Show that the 
image of f will also be computable in ?£. Discuss what this means for 
the relation between semicomputable sets and c.e. sets relative to 7E. 


b) Show that the class of sets semicomputable in 7 is closed under finite 
unions and intersections, but not under complements. 


c) Show that two disjoint D}-sets can be separated by a Aj-set. 
Hint: Use Gandy selection. 


d) Show that there are disjoint II}-sets that cannot be separated by any Aj- 


sets. 
Hint: Use an analogue of the construction of two computably inseparable 
c.e. sets. 

Exercise 4.13 Let ¢ = 7,...,7 — b be a type, where b is a base type. 


Show that there is a term D, of type 
0,0,0 7-0 
such that whenever s and t are terms of type a, then, 
Dg (tt)st —> Nay +++ ATT say +++ By. 
Do (ff)st — Aaj) +--+ Aut tay: ++ Tn. 


Exercise 4.14 Prove Theorem 4.5.9 in detail. 
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Chapter 5 


Non-trivial exercises and 
minor projects 


In this chapter we have collected some exercises or minor projects that are 
based either on the material in more than one chapter or that are too hard to 
be considered as exercises in the ordinary sense. We call them exercises, though 
some of the items in this chapter are much more demanding than ordinary 
exercises. 


Exercise 5.1 Let L be a finite language. Show that the relation ‘¢ is true in 
all finite L-structures’ is decidable if all predicates are unary, while it in general 
is complete co-c.e. 


Exercise 5.2 Show that there is a computable partial ordering (N, <) such 
that every countable partial ordering (X’, <’) can be embedded into (N, ~). 


Exercise 5.3 In this exercise we will explore Skolem’s construction of a non- 
standard model for number theory. We adjust the proof to the setting that we 
have been using in this compendium. 

Let D* be the set of subsets of N* definable in the language of Theorem 1.2.2. 
We let D = D!. 

If f: NN, we let f €¢ Dfv" if the graph of f is in D?. 


a) Show that D is countable. 


b) Show that there is a maximal, nonprincipal filter F on D, i.e. aset F C D 
such that 


- Nef. 

- F has no finite elements. 

-If DE Fand DCEED, then EEF. 
-If DE Fand EEF, then DN EEF. 
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- If DED, then DEF orN\ DEF. 
c) For f,g € Diu’, let 
f=Frg 
if 
{n; f(n) = g(n)} €F. 
Show that =¢ is an equivalence relation. 


d) Let \’* ve the set of =¢-equivalence classes [f], with the interpretations 
of the non-logical symbols in L as follows: 


0* and 1* are the equivalence classes of the respective constant func- 
tions. 


Addition, multiplication and exponentiation of equivalence classes 
are defined via the corresponding pointwise operations on represen- 
tatives. 


[f] <* [g] @ {n; f(n) < g(n)} € F. 


Work out the details and show that these concepts are well defined, i.e. 
independent of the choice of representatives. 


e) Let o(a1,...,@%) be a formula in L. 
Let fi,.--, fe € DFu”. 
Show that 


N* & O([fi,---> [fe]) {rn 3 ME o(fi(n),.-., fe(n))} € F. 
Hint: Use induction on @. 


f) Show that NV and AN’* are not isomorphic, and describe an elementary 
embedding of NV into N*. 


Exercise 5.4 For the sake of notational simplicity we extend the second order 
languages with variables for functions as well. 


a) Find a second order formula that expresses that if X is a set, f: X — X 
and « € X then Y = {f"(x) ; n € N}. 


b) Discuss to what extent our formal definition of a finite set is in accordance 
with your intuition about set theory. 


c) It is a fact of set theory that every set accepts a total ordering. Use 
this and our formal definition of infinity to show that the following are 
equivalent. 


1. X is finite. 


2. Every total ordering of X is a well ordering. 
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Exercise 5.5 Let La be the language of equality. We say that a subset A of 
N is definable in second order finitary logic if there is a sentence ¢ in L2 such 
that n € A if and only if {0,...,.n—1} 4. 

Discuss the properties of the class of sets definable in second order finitary logic 
in terms of decidability, complexity and closure properties. 


Exercise 5.6 Let & be a fixed alphabet. Show that there is an enumeration 
{Mi}ien of the set of Turing machines over © and a one-to-one enumeration 
{w;}jen of the set of words in &* such that the function f satisfying the equation 


fli,j) =k & Mi(w;) = wp 


is computable, where ~ means that either is both sides undefined, or both sides 
are defined and with the same value. 


Exercise 5.7 In the text we have indicated how to establish some of the prop- 
erties of the hierarchy {Fy}a<e,- Work out all the details and write a small 
essay about it. You may well combine this with Exercise 5.8. 


Exercise 5.8 Show that if a < €9 then Fy is provably computable. Suggest a 
fundamental sequence for €9 and thus a function F.,. If properly defined, Fi, 
will not be provably computable in PA, but in some second order extension. 
Discuss how we may formulate a proof of the totality of F., in some second 
order extension of PA. 


Exercise 5.9 Let L be a first order language, and let X be a new predicate 
symbol of arity 1. Let L’ be L extended with X. Let ¢(#, X) be a formula in 
L’ with at most x free. 

The predicate X (which we may think of as a set variable) is 


positive in @ if X does not occur in ¢, if ¢ is of the form X(t), if ¢ is of 
one of the forms v1 V w#2 or Sy where X is positive in WY, and we or if d 
is of the form —w where X is negative in w. 


negative in @ if X does not occur in @, if ¢ is of the form —X(t), if ¢ is of 
one of the forms w1 V w2 or Syv1 where X is negative in v1 and wz or if 
¢ is of the form 7w where X is positive in w. 


Now, let X be positive in ¢(#,X). Let 2% be an L-structure, B C A. Let 
T'g(B) = {ae A; (a, B)}. 


a) Show that Iy is a monotone operator on the set of subsets of A, i.e. that 
if B CC then T¢(B) Cc Tg(C). 


b) Show that Ty will have a least fixed point, i.e. there is a least set B such 
that [y(B) = B. 


c) Now let L be the language of number theory and 2 the standard model 
MN of number theory (i-e. the natural numbers as a structure). Let X be 
positive in (a, X). Show that the least fixed point of Tg is a II}-set. 
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d) Show that there is an m-maximal II}-set that can be defined by positive 
induction. 


e) Hard! Show that there is a II}-set that is not the least fixed point of any 
Tg. 


f) Optional! Show that if X is positive in ¢, then I'y will have a maximal 
fixed point. 
Show that if ¢ is a positive IT?-formula, i.e. we do positive induction over 
N, then the maximal fixed point of I'g will also be II?. 


Exercise 5.10 Let L be a finite first order language with equality as the only 
relation symbol. Let 2 be a countable L-structure. 

We say that 2 is computable if there is an onto map a: N > A and computable 
functions f : N* — N for each function symbol f of arity k such that 


a(f(mi,---,7%)) = F*(a(nz),...,a(nz)) for each function symbol f. 
{(n,m) ; a(n) = a(m)} is ce. 


Warning! Our definition is customized to this exercise, and the term ‘com- 
putable structure’ may be used in a different way elsewhere in the literature. 


a) Trivial! Show that there exist computable fields of any characteristics. 


b) Harder! Show that the algebraic closure of a computable field is itself 
computable. 


c) Hard! Is there a computable field such that the set of irreducible polynomi- 
als is not computable? What is the best you can say about the complexity 
of the set of irreducible polynomials in a computable field? 


Exercise 5.11 Recall that a well ordering will be a total ordering (X,<) such 
that any nonempty subset of X will have a least element. An initial segment of 
a well ordering (X, <) will be a subset Y of X such that ye YAr<ySavey, 
together with the ordering < restricted to Y. 


a) Show that an initial segment of a well ordering is itself a well ordering. 


b) Show that if (X1,<1) and (X2, <2) are two well orderings and 7 and 72 
are two isomorphisms between initial segments of (X1,<,) and (X2, <2) 
resp. , then 71(x) = 72(x) if they are both defined. 

Hint: Assume not. Consider the maximal initial segment where they 
agree, and show that they must agree on the ‘next’ element as well. 


c) Show that if (X1,<1) and (X2, <2) are two well orderings then they are 
either isomorphic or one is isomorphic to a proper subset of the other. In 
all cases the isomorphism is unique. 

Hint: Let 7 be the union of all isomorphisms between initial segments of 
the two well orderings. Then 7 is the unique isomorphism in question. 


138 


An ordinal number will be a set a such that 


a is transitive, i.e. if 6 € a then ( is a set and if moreover y € (3, then 
yea. 


(a, €) is a well ordering. 


d) Show that the empty set @ is an ordinal number, and that ) € a for all 
non-empty ordinal numbers. Describe the three smallest ordinal numbers. 


e) Show that if a is an ordinal number and 6 € a, then ( is an ordinal 
number. 


f) Show that if a and 6 are two ordinal numbers, then a = (3, a € £ or 
Bea. 
Hint: Show that identity functions are the only possible isomorphisms 
between initial segments of ordinal numbers. 


g) Show that if a is an ordinal number, then a + 1 defined as aU {a} is an 
ordinal number. 


h) Show that the union of any set of ordinal numbers is an ordinal number. 


A binary relation (X,R) is well founded if each nonempty subset Y of X has 
an R-minimal element x, i.e. an element x € Y such that y ¢ Y whenever yRz. 


i) Let (X,R) be a well founded relation. Let R* be the transitive (but not 
reflexive) closure of R. Show that R* is a well founded relation. 


j) Show that if (X, R) is a well founded relation, there is a unique function 
p mapping X into the ordinal numbers such that 


Va € X (p(x) =({oly) +1; yR*2}). 


Hint: Use well foundedness to show that there is a maximal partial solution 
to the function equation for p, and once again to show that this maximal 
solution will be defined on all of X. 


The function p will be called the rank function of R. The image of p will be an 
ordinal number and will be called the rank of R. 


Exercise 5.12 This exercise requires the combined knowledge of some ax- 
iomatic set theory, standard computability theory and computability in 7F. 

Let X be a hereditarily countable set, A a binary relation on B C N. We say 
that A is a code for X if B, A is isomorphic to the transitive closure of {X}UX 
with the €-relation. 


a) Show that the set of codes is IT}. 
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b) 


c) 


Show that equality between the corresponding sets is a property of the 
codes computable in 7£. (Given two codes, 7E can decide if they are 
codes for the same element or not.) 

Hint: Use the recursion theorem for ?E. 


Show that there are operators on the codes that are computable in ?7E 
and that simulates finite unions, unordered pairs and set subtraction on 
sets. 


An ordinal number a is called computable if a has a computable code. Observe 
that if a is computable and @ < a, then 3 is computable. 


d) 


Show that if a is a computable ordinal number, then every X € L(a) will 
have a hyperarithmetical code. 
(Here L(q) is level no. a in Gédel’s hierarchy of constructible objects.) 


Let w?* (omega-1-Church-Kleene) be the least non-computable ordinal. 


e) 


h) 


Show that the ordinal rank of a computable well founded tree will be 
computable. 

Hint: Show that the partial ordering 0 < 7 & 7 = o on a computable 
well founded tree can be extended to a total, computable well ordering by 
combining it with the lexicographical ordering. (This combined ordering, 
if constructed correctly, is called the Kleene-Brouwer ordering). 


Show that the ordinal rank of any At-well founded tree will be a com- 
putable ordinal. 

Hint: Show that if this is not the case, we may use stage comparison 
to show that the set of computable well founded trees is computable in 
?E. This will contradict that this set is uniformly II} and that there are 
Il}-sets that are not hyperarithmetical. 


Show that every hyperarithmetical set is in L(w?*). 
Hint: Show that the set of computations of length < a, where the argu- 
ments may be ?7F and numbers, will be in L(a + 1). 


Show that L(wf*) satisfies Ag-comprehension and replacement. 
Hint: For the replacement axiom you may find Gandy selection useful. 


If you have worked through all items in this rather extensive exercise, you have 
proved that L(w?*) is a model of what is known as Kripke-Platek set theory, 
a much studied fragment of set theory. 


Exercise 5.13 Work out all the details in Section 4.5.3 


Exercise 5.14 Recall that a tree is a set of finite sequences, and that a tree 
will be well founded if there is no infinite branch. 

A proof tree in an w-logic T will be a well founded tree T of sequences (¢0,.--; On) 
of formulas ¢; with n > 0 such that 
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e If (¢0,..-,¢n) and (¢9,...,¢/,,) are in T, then oo = $9. 
e If (¢0,.--;¢n) € T is a leaf node, then ¢, is an axiom in T. 


e If (¢0,..-;¢n) € T is not a leaf node, then ¢, is a consequence of 
{3 (@0,---,¢n,¢) € T} by the rules of w-logic. 


By a suitable Gédel numbering, we may view the proof trees as trees over the 
natural numbers. 
If JT is a proof tree, we call it an w-proof for do. 


a) Show that for any formula ¢, T'F,, ¢ if and only if there is an w-proof for 
d. 


b) One technical obstacle in w-logic is that an w-proof may be infinite and 
consume all variables. Show that this is a minor obstacle, i.e. show that 


whenever 41,...,Yn are substitutable for 71,...,%, ind, and T F,, ¢, then 
Dre Oy an 


Hint: Use induction on the ordinal rank of the proof tree. 
Show that the proof tree of T7’1»4'" is computable in 2E, the proof tree 
for @ and the set of Gédel numbers for the axioms in T. 


c) Prove the theorem of constants for w-logic in detail. 


Exercise 5.15 In the sections on model theory we focused on countable mod- 
els and on finite models. This is of course only a part of model theory. In 
this exercise we will consider a tiny fragment of model theory for uncountable 
structures. 


Definition 5.1 Let Z be a first order language and « a cardinal number. We 
say that an L-structure % is «-saturated if for all sets B C A = || of cardinality 
<« and all 1-types X in L[B] (relative to the theory of L[B]-sentences true in 
Ql) are realized in 2. is fully saturated if 2 is K-saturated where « is the 
cardinality of |2U]. 


Our definition of saturated corresponds to w-saturated. 


a) Show that for any consistent theory T on L and cardinal number « there 
exists a «-saturated model for T. 


b) Assume the existence of an unbounded class of inaccessible cardinals. 
Show that for any consistent theory T there exists a fully saturated model 
for T of arbitrary large cardinality. 
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Chapter 6 


Appendix: 
Some propositions from a 
beginners course in logic 


Proposition 6.1 The Completeness Theorem, two versions 


a) Let T be a first order theory. Then T is consistent if and only if T has a 
model. 


b) Let T be a first order theory over the language L and let @ be a formula 
in L. Then 
TFroselEe¢ 


Proposition 6.2 The Compactness Theorem 
Let T be a first order theory. Then T has a model if and only if each finite 
subtheory Ty of T has a model. 


Proposition 6.3 The Deduction Theorem 
Let T be a first order theory over a language L, let ¢ be a sentence in L and w 
a formula in L. Then 


T.d+veTHo—y. 


Proposition 6.4 The Theorem of Constants 

Let T be a first order theory over a language L and let c,,...,Cpn be constants 
that are not in L. Let L’ be the extended language. 

Let T’ be the theory over L' with the same non-logical axioms as T, and let 
be a formula in L. Then for all lists 11,...,%n of variables 


TE PST! b pitrte, 
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